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Abstract. We recast the problem of calculating Vapnik-Chervonenkis (VC) den- 
sity into one of counting types, and thereby calculate bounds (often optimal) on the 
VC density for some weakly o-minimal, weakly quasi-o-minimal, and P-minimal 
theories. 
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1. Introduction 

The notion of VC dimension, which arose in probability theory in the work of Vapnik and 
Chervonenkis [98J , was first drawn to the attention of model-theorists by Laskowski , 
who observed that a complete first-order theory does not have the independence property 
(as introduced by Shelah [55]) if and only if, in each model, each definable family of 
sets has finite VC dimension. With this observation, Laskowski easily gave several 
examples of classes of sets with finite VC dimension, by noting well-known examples 
of theories without the independence property. This line of thought was pursued by 
Karpinski and Macintyre [49], who calculated explicit bounds on the VC dimension of 
definable families of sets in some o-minimal structures (with an eye towards applications 
to neural networks), which were polynomial in the number of parameter variables. In a 
further paper |50j , they observe that their arguments also lead to a linear bound on the 
VC density of definable families of sets in some o-minimal structures. They ask whether 
similar (linear) bounds hold for the p-adic numbers (whose theory also does not have 
the independence property). The bound in the o-minimal case in |50j was established 
independently, using a more combinatorial approach, by Wilkie (unpublished), and more 
recently, also by Johnson and Laskowski [47] . 



Date: September 2011. 



2 



ASCHENBRENNER, DOLICH, HASKELL, MACPHERSON, AND STARCHENKO 



In this paper we give a sufficient criterion fTheorem lS.Tp on a first-order theory for 
the VC density of a definable family of sets to be bounded by a linear function in 
the number of parameter variables, and show that the criterion is satisfied by several 
theories of general interest, including the theory of the p-adics and all weakly o-minimal 
theories. In a sequel to this paper [6 we give different arguments to get similar bounds 
in a variety of other examples where our criterion does not apply. Before we state our 
main results, we introduce our setup and review some definitions and basic facts. We 
hope that the present paper (unlike its sequel [B]) can be read with only little technical 
knowledge of model theory beyond basic first order logic. The first few chapters of [12] 
or [63] or similar texts should provide sufficient background for a prospective reader. 

1.1. VC dimension and VC density. Let X be an infinite set and 5 be a non-empty 
collection of subsets of X. Given A C X, we say that a subset i? of A is cut out by S if 
B ^ SnAioi some 5 S 5; we let 5 n A {5 n A : 5 e 5} be the collection of subsets 
of A cut out by S. We say that A is shattered by S if every subset of A is cut out by 
some element of S. The collection S is said to be a VC class if there is a non-negative 
integer n such that no subset of X of size n can be shattered by S. In this case, the VC 
dimension of S is the largest d > such that some set of size d is shattered by S. We 
denote by 7r5(n) the maximum, as A varies over subsets of X of size n, of the numbers 
of subsets of A that can be cut out by S; that is, 

ns{n) max||5n A| : A e 

(Here and below, (^) denotes the set of n-element subsets of X.) The function n i— > 
775(71) is called the shatter function of S. Clearly < nsin) < 2" for every n, and 
if S is not a VC class, then TTs{n) = 2" for every n. However, if 5 is a VC class, of 
VC dimension d say, then by a fundamental observation of Sauer [83] (independently 
made in [57| and, implicitly, in [HH]); the function n TTsin) is bounded above by a 
polynomial in n of degree d. (In fact, for d, n > 1 one has TTs{n) < (en/d)'^, where e 
is the base of the natural logarithm.) Hence it makes sense to define the VC density 
of a VC class S as the infimum of all reals r > such that TTs{n)/n^ is bounded 
for all positive n. It turns out that in many case, the VC density (rather than the 
VC dimension) is the decisive measure for the combinatorial complexity of a family of 
sets. For example, the VC density of S governs the size of packings in S with respect 
to the Hamming metric ([41], see also [64] Lemma 2.1]), and is intimately related to the 
notions of entropic dimension [7] and discrepancy [68] . We refer to the surveys [65l [33] 
for uses of VC density in combinatorics. 

1.2. VC dimension and VC density of formulas. Let £ be a first-order language. 
In an £-structure M, a natural way to generate a collection of subsets of M™ is to 
take the family of sets defined by a formula, as the parameters vary. Given a tuple 
X = (xi, . . . , Xm) of pairwise distinct variables we denote by |a;| := m the length of x. We 
often need to deal with /^-formulas whose free variables have been separated into object 
and parameter variables. We use the notation ip{x; y) to indicate that the free variables 
of the £-formula ip are contained among the components of the tuples x = {xi, . . . , Xm) 
and y = {yi, . . . , y„) of pairwise distinct variables (which we also assume to be disjoint). 
Here the Xi are thought of as the object variables and the yj as the parameter variables. 
We refer to <p(x; y) as a partitioned C-formula. 
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In the rest of this introduction we let M be an infinite >C-structure. Let (p{x; y) be a 
partitioned £- formula, r7i=|a;|,rt=|2;|, and denote by 

= {<^^(M™;6) : 6e A/"} 

the family of subsets of M™ defined by in M using parameters ranging over M". We 
call Sip a definable family of sets (in M). We say that ip defines a VC class in M if 
Sip is a VC class; in this case the VC dimension of tp in M is the VC dimension of the 
collection S^p of subsets of Af™, and similarly one defines the VC density of (p in M. 
Since the shatter function tt^ = ng^ of S^ only depends on the elementary theory of M 
(see Lemma 13.21 below), given a complete >C-theory T with no finite models, we may also 
speak of the shatter function of tp in T as well as VC dimension of ip in T and the VC 
density of tp in T. 

1.3. NIP theories. A partitioned £-formula (p{x; y) as above is said to have the inde- 
pendence property for M if for every t G N there are bi, . . . ,bt € Af" such that for every 
S" C {1, . . . , there is as e Af" such that for alH G {1, . . . , t}, M |= (p{as] h) -^=^ i G 
5*. The structure M is said to have the independence property if some £-formula has 
the independence property for 7W, and not to have the independence property (or to 
be NIP or dependent) otherwise. By a classical result of Shelah [8S] (with other proofs 
in [52l [55l [80]), for M to be NIP it is actually sufficient that no formula ip{x; y) with 
|a;| = 1 has the independence property for M. NIP is implied by (but not equivalent to) 
another prominent tameness condition on first-order structures called stability: An C- 
formula (p(a;; y) is said to be unstable for M if for every t e N there are oi, . . . , Of € Af™ 
and &i, . . . , 6t S A^" such that M \= ip{ai;bj) <^=> i < j, for all i,j S {!,..., t}. The 
^-structure M is called unstable if some £-formula ^p is unstable for M; and "stable" 
(for formulas and structures) is synonymous with "not unstable." 

Laskowski's observation [55] is that an £-formula defines a VC class in M if and 
only if it does not have the independence property for M. In fact, given a collection iS 
of subsets of a set X, define the dual shatter function of S as the function n i— > 7r^(n) 
whose value at n is the maximum number of equivalence classes defined by an n-element 
subfamily T of S, where two elements of X are said to be equivalent with respect to T 
if they belong to the same sets of T. Then a given partitioned £-formula fix; y) has the 
independence property precisely if ttJ^ (n) = 2" for every n. The dual shatter function 
of Sip is really a shatter function in disguise: it agrees with the shatter function of 
where (p*{y; x) := (p{x; y) is the dual of the partitioned formula (p. (See Section |3l) 

A complete £-theory T is said to have the independence property if some model of it 
does, and is said not to have the independence property (or to be NIP) otherwise. Thus 
a complete £-theory T is NIP if and only if every £-formula defines a VC class in every 
model of T. Many theories arising in mathematical practice turn out to be NIP: By [55] . 
all stable theories (i.e., complete theories all of whose models are stable) are NIP; so, 
for example, algebraically closed (more generally, separably closed) fields, differentially 
closed fields, modules, or free groups furnish examples of NIP structures. Furthermore, 
o-minimal (or more generally, weakly o- minimal) theories are NIP [55l [6T] . By [36] any 
ordered abelian group has NIP theory. Certain important theories of henselian valued 
fields are NIP, for example, the completions of the theory of algebraically closed valued 
fields and the theory of the field of p-adic numbers (and also their rigid analytic and 
p-adic subanalytic expansions, respectively). In fact, in the language of rings with a 
predicate for the valuation ring, an unramified henselian valued field of characteristic 
(0,p) is NIP if and only if its residue field is NIP [12]. Similarly, henselian valued fields 
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of characteristic (0, 0) and algebraically nraximal Kaplansky fields of characteristic (p, p) 
are NIP iff their residue fields are NIP [T31IT2]. 

On the other hand, each pseudofinite field (infinite model of the theory of all finite 
fields) is not NIP [29j, since it defines the (Rado) random graph. 

1.4. Uniform bounds on VC density. This paper is motivated by the following 

question: Given a NIP theory T , can one find an upper hound, in terms of n only, on 
the VC densities {in T) of all C-formulas ip{x;y) with \y\ = n? The intuition behind 
this question is, of course, that the complexity of a family of sets defined by a first- 
order formula ^{x; y) in a NIP structure should be governed by the number n of freely 
choosable parameters. Note that the minimum possible bound is \y\ — n: for if (p{x; y), 
where a: is a single variable, is the formula x = yi V • • • V a: = then the subsets of M 
cut out by Stp are exactly the non-empty subsets of M of cardinality at most n, so (p{x; y) 
has VC density n (in any complete theory) . We note here in passing that the VC density 
of a formula ip in a NIP theory may take fractional values, and that the shatter function 
of Sip, though not growing faster than polynomially, is not asymptotic to a real power 
function in general. See Section |4] below, where we explicitly compute the VC density 
of certain incidence structures (related to the Szemeredi- Trotter Theorem) and of the 
edge relation in Spencer-Shelah random graphs, and investigate the asymptotics of a 
shatter function in the infinitary hypercube. 

In this paper we employ VC duality to translate the problem of bounding the VC den- 
sity of a formula (p into the task of counting (/3*-types over finite parameter sets, which 
then can be treated by model-theoretic machinery. Viewing VC density as a bound on 
a number of types also illuminates the connection with a strengthening of the NIP con- 
cept, which is that of dp -minimality. (See Section 15.31 below for a definition.) Dolich, 
Goodrick and Lippel [24] have observed that, if, in a theory, the dual VC density of 
any £-formula in a single object variable is less than 2, then the theory in question is 
dp-minimal. (No counterexample to the converse of this implication seems to be known.) 

We now state our main results. First, an optimal bound on density is obtained for 
weakly o-minimal theories (see Theorem 16.11 below). Recall that a complete theory T 
in a language containing a binary relation symbol "<" which expands the theory of 
linearly ordered sets is called weakly o-minimal if in every model of T, each partitioned 
£-formula ip{x;y) with |a;| = 1 defines a finite union of convex sets. (See |61) for more 
on this notion, which generalizes the probably more familiar concept of an o-minimal 
theory, cf. [25].) 

Theorem 1.1. Suppose C contains a binary relation symbol "<", interpreted in M as 
a linear orderiny. If T — Th{M) is weakly o-minimal, then every C-formula 'p{x] y) 
has VC density at most n —\y\ inT {in fact, Trip{t) = 0{t"')). 

This bound is the same as that obtained by Karpinski-Macintyre [49] for o-minimal 
expansions of the reals, or by Wilkie and by Johnson-Laskowski [47| for all o-minimal 
structures. The motivating example of a theory which is weakly o-minimal but not o- 
minimal is the theory of real closed valued fields, that is, real closed fields equipped with 
a predicate for a proper convex valuation ring. In fact, the methods of Karpinski and 
Macintyre can also be adapted to give the correct density bounds for this and certain 
other weakly o-minimal expansions of real closed fields [40) . Some interesting weakly 
o-minimal theories to which these methods do not readily adapt may be found in [3 [54] . 
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Our approach to Theorem II. ![ via definable types, was partly inspired by the use of 
Puiseux series in [TTl |8T] . 

Let ACVF denote the theory of (non-trivially) valued algebraically closed fields, 
in the ring language expanded by a predicate for the valuation divisibility. This has 
completions ACVF(o,o) (for residue characteristic 0), ACVF(o.p) (field characteristic 0, 
residue characteristic p), and ACVF(j,^p) (field characteristic p). Because ACVF(o_o) is 
interpretable in RCVF, our methods give (non-optimal) density bounds for ACVF(o,o) 
fCorollarv l6.3p . However, they give no information on density in the theories ACVF(o,p) 
and ACVF(p p). The problems arise essentially because a definable set in 1-space in 
ACVF is a finite union of 'Swiss cheeses' but we have no way of choosing a particular 
Swiss cheese. This means that the definable types technique in our main tool (Theo- 
rem [5?7]) breaks down. On the other hand, our methods do yield: 

Theorem 1.2. Suppose M = Q,p is the field of p-adic numbers, construed as a first- 
order structure in Macintyre's language Cp. Then the VC density of every Cp- formula 
(p{x; y) is at most 2\y \ — 1. 

The same result holds for the subanalytic expansions of Qp considered by Denef and 
van den Dries [22]. (Theorem 17.21 and Remark 17.91 ) Key tools available here, but 
not in the case of ACVF, are cell decomposition and the existence of definable Skolem 
functions. We do not know whether the bound in Theorem 11.21 is optimal. 

The investigation of the fine structure of type spaces over finite parameter sets in 
NIP theories is only just beginning, and the present paper can be seen as a first step in 
studying one particular measure (VC density) for their complexity. Applications of the 
results in this paper to transversals of definable families in NIP theories will appear in 
a separate manuscript, under preparation by the first- and last-named authors. 

As remarked above, all stable theories are NIP, so it also makes sense to investigate 
VC density in stable theories. In a sequel of the present paper [6] we obtain bounds 
on VC density in certain finite U-rank theories (including all complete theories of finite 
Morley rank expansions of infinite groups). 

We close off this introduction by pointing out that besides being of intrinsic interest, 
uniform bounds on VC density of first-order formulas (as obtained in this paper) often 
also help to explain why certain well-known effective bounds on the complexity of geo- 
metric arrangements, used in computational geometry, are polynomial in the number of 
objects involved. For example, the bound on the number of semialgebraically connected 
components of realizable sign conditions on polynomials over real closed fields from 
181] breaks up into a topological and a combinatorial part, where the polynomial 
nature of the latter may be seen as a consequence of Theorem 11.11 

Example. Let i? be a real closed field, V = {Pi, . . . , P^) be a tuple of polynomials from 
R[X] = R[Xi, . . . ,Xk], each of degree at most d. A sign condition for V is an s-tuple 
ae{ — 1,0,+1}'', and we say that a is realized in a subset V of R'' if 

av ■■= {aeV : (signFi(a), . . .,signPs{a)) = a} 

is non-empty. Theorem 11.11 in the semialgebraic case yields: if V is an algebraic set 
defined by polynomials of degree at most d, then the number of sign conditions for V 
realized in V is at most Cs™, where m = dim{V) and the constant C = C{d,k) only 
depends on d and k. 
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To see this recall that by cell decomposition, y is a finite union of semialgebraic 
subsets of i?*^ each of which is semialgebraically homeomorphic to some i?"; moreover, 
this decomposition (and the resulting homeomorphism) can be chosen uniformly in the 
parameters: Every zero set of polynomials from R[X] of degree at most d is the zero set 
of M such polynomials, where M — {''^'^) is the dimension of the i?-linear subspace of 
R[X] consisting of the polynomials of degree at most d; thus we may take a semialgebraic 
(in fact, algebraic) family {Vb)b^iiN, where N — M^, whose fibers Vb are the algebraic 
subsets of defined by polynomials of degree at most d. Then there are finitely many 
semialgebraic families {V^^'')b£RN of subsets of R^ and for each i there is a semialgebraic 
family (-F]|*^)hgfijv of maps such that for each b £ R^ we have Vb = IJj Vf,^*\ and F^^*^ is 
a homeomorphism i?™'"' — > V^^\ for some m'*^ 

Fix some i and write m =^ m^'-*. Let ly — {yi, . . . ^v^) range over N'"', with \v\ = 
1^1 + • • • + t'fe, and suppose y = {yu)\v\<d^ so y has length M. Let P{X] y) be the general 
polynomial in the indeterminates X of degree at most d with coefficient sequence y; so 
every Pj is of the form Pj — P{X; bj) with bj e i?*^. Suppose also x — (xi, . . . ,Xm), 
and let z be a tuple of new variables of length N, let z' be a single new variable, and 
let i^*-*-* {x; y, z, z') be a formula in the language of ordered rings which expresses that 
P{Fz^\x);y) and z' have the same sign. So, e.g., for a e -R™, b £ i?^ we have R |= 
(/j*^*^ (a; bj, b, 1) iff Pj (-F'fl*'' (a)) > 0. In this way we see that the number of sign conditions 
for V realized in V^^^ is bounded by tt*;;, (3s) and thus is 0(s™) by Theorem ll.il where 
the implicit constant only depends on ip^^^ and hence on d and k. This yields the claim 
highlighted above. (Of course we have been very nonchalant with the constants. Indeed, 
|llj shows the more precise result that the sum of the number of semialgebraically 
connected components of the sets uy, where a ranges over all sign conditions for V 
realized in V, is bounded by (©(d))*^ (^^) .) 

A simpler example is the number of non-empty sets definable by equalities and in- 
equalities of a finite collection of polynomials over an algebraically closed field: 

Example. Here we let v = (z/i, . . . , Vm) range over N™, and suppose y — {yu)\L'\<d- Let 
^{x; y) be the partitioned formula 

\v\<d 

in the language C of rings, and fix an algebraically closed field K. Then Sip = is the 
collection of all zero sets (in K"^) of polynomials in m indeterminates with coefficients in 
K having degree at most d. Hence tt^^ {t) is the maximum number of non-empty Boolean 
combinations of t such hypersurfaces. In the sequel of our paper (see [6l Theorem 1.1]) 
we will show that the shatter function of any partitioned /^-formula with m parameter 
variables (such as ip*) is C'(t'") in Th(i4r); hence 7r*(<) = TT^.{t) = 0(1""). (In fact, gS] 
proves that tt* (t) < X]fc=o (D^*^ every t, and this bound is asymptotically optimal.) 

1.5. Organization of the paper. In the preliminary Section [2] we set the scene by 
recalling the definitions and basic facts concerning VC dimension and VC density in 
a general combinatorial setting. In Section [3] we then move to the model-theoretic 
context; in particular we introduce the VC density function of a complete theory without 
finite models, and the (dual) VC density of a finite set of formulas. In Section |4] we 
give some interesting examples of formulas in NIP theories for which we can explicitly 
compute their VC density or determine the asymptotic behavior of their shatter function. 
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In Section [5] we introduce the VC d property (a refinement of Guingona's notion of 
uniform definability of types over finite sets) and get our main tool for counting types 
(Theorem 15.71) in place, which is then employed, in Section [51 to prove Theorem 11.11 
from above. A strengthening of the VCd property is defined and established for the 
p-adics in Section [71 thus proving Theorem 11.21 We refer to the introductions of each 
section for a more detailed description of their contents. 

1.6. Notations and conventions. In this paper, d, k, m, n range over the set N := 
{0, 1,2,...} of natural numbers. We set [n] := {1, . . . , n}. Given a set X, we write 2-^ 
for the power set of X, and we let (^J denote the set of n-element subsets of X and 
(^J :— ('^) U ('^j U • • • U ('^) the collection of subsets of X of cardinality at most n. 
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authors were participating in the thematic program on 0-minimal Structures and Real 
Analytic Geometry at the Fields Institute in Toronto (Spring 2009), and in the Durham 
Symposium on New Directions in the Model Theory of Fields (July 2009), organized by 
the London Mathematical Society and funded by EPSRC grant EP/F068751/1. The 
support of these institutions is gratefully acknowledged. Aschenbrenner was partly 
supported by NSF grant DMS-0556197. He would also express his gratitude to Gerhard 
Woginger for suggesting the example in Section 14.4.11 and to Andreas Baudisch and 
Humboldt-Universitat Berlin for their hospitality during Fall 2010. Haskell's research 
was supported by NSERC grant 238875. Macpherson acknowledges support by EPSRC 
grant EP/F009712/1. Starchenko was partly supported by NSF grant DMS-0701364. 

2. VC Density 

In this section we introduce various numerical parameters associated to abstract families 
of sets: VC dimension, VC density, and independence dimension, and we recall the 
well-known phenomenon of "VC duality" hinted at already in the introduction (which, 
in particular, allows us to relate VC dimension with independence dimension). An 
important role in later sections is played by a new parameter associated to a set system 
defined here, which we call breadth, and which is the focus of the last part of this section. 

2.1. VC dimension and VC density. A set system is a pair {X,S) consisting of a 
set X and a collection S of subsets oi X. We call X the base set of the set system {X, S), 
and we sometimes also speak of a set system S on X . Given a set system (X, S) and a 
set A C X , we let S n A -.^ {S n A : S e S} and cah {A,S n A) the set system on A 
induced by S. Let now 5 be a set system on an infinite set X. The function tt^ ; N — > N 
given by 

TTsin) max ||5 n A\ : A e 

is called the shatter function of S. We have < TTs{n) < 2" and Trs{n) < 7r5(n + 1) for 
all n. Note that if Y D X then does not change if S is considered as a set system 
on Y. (This justifies our choice of notation for the shatter function, suppressing the 
base set X of our set system.) 

One says that A C X is shattered by S if S Ci A ^ 2"^ . If 5 is non-empty, then we 
define the VC dimension of S, denoted by VC(iS), as the supremum (in NU {oo}) of 
the sizes of all finite subsets of X shattered by S; so VC(5) — oo means that arbitrarily 
large finite subsets of X can be shattered by S. Equivalently, 

VC(5) = sup {n : TTs{n) = 2"}. (2.1) 
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One says that 5 is a VC class if VC(5) < oo. Note that some sources (e.g., |55j ) 
alternatively define the VC dimension of S to be the minimum n such that no set of 
size n is shattered by S (i.e., VC{S) + 1, with YC{S) as given by (|2.ip V 

We have the following fundamental fact about set systems: 

Lemma 2.1 (Sauer-Shelah). IfS has finite VC dimension d {so TTs{n) < 2" for n > d), 
then 

''^^"^ - (< d) (o) + ■ ■ ■ + (d) '"'"^ 

If n > d, then (^*^) is bounded above by [en/d)'^ (where e is the base of the natural 
logarithm). In particular, either usin) — 2" for every n (if S is not a VC class), or 
T^sin) = Oin'^). One may now define the VC density vc(5) of S as the infimum of all 
real numbers r > such that T^s{n) = 0{n^), if there is such an r, and vc(5) := oo 
otherwise. That is, 

r<:\ r log7r5(n) 
vc(6) — Imisup — . 

n^oo log n 

We also define VC(0) := vc(0) -1. Then vc(5) < VC(5) by Lemma O and 
vc(5) < 00 iff VC(5) < oo. The VC density of S is also known as the real density [7] or 
the VC exponent [17] of S. It is related to the combinatorial dimension of S introduced 
by Blei [8] and to compression schemes for S [47] . 

Example. Suppose S — {^j}- Then the inequality in the statement of Lemma l2. II is an 
equality, and VC(5) = vc(5) — d. 

Example. Suppose X = R'', and S is the collection of all closed affine half-spaces in M.'^, 
i.e., sets of the form {x G R'^ : {x, a) > (3} where a G R'^, /3 S M, and ( , ) denotes the 
usual inner product on R'^. Then VC{S) ~ d + 1. (The proof of this fact is based on 
Radon's Theorem on convex sets; see [7] CoroUaire 3.5].) Moreover, vc(5) = d: in fact, 
TTsin) = 2X;to(-l)''~*(<i) for every n; see [30| Theorem 3.1]. 

Example. Suppose X — M., k > 1, and let S be the collection whose members are 
the unions of k disjoint (open) intervals in R. Then VC{S) = vc{S) — 2k, in fact, 
T^sin) — (<2fc) for each n. (See [28, Exercise 11, Chapter 4].) 

In all three examples, tts is actually given by a polynomial of degree d — vc(5). It 
is worth pointing out that for a VC class S, in general is not even asymptotic to a 
real power function; see Section 14.41 below. 

Clearly, VC and vc are increasing: if 5 C T C 2^^, then irg < ttj- and so VC(iS) < 
VC(T) and vc(iS) < vc(T). If X' is an infinite subset of X then tt^hX' < t^s', rnore 
generally (see [7] Proposition 2.2]): 

Lemma 2.2. Let X' he an infinite set and f : X' ^ X be a map, and let f^^{S) := 
{f^^{S) : S G S}. Then Trj-i^g-) < irg, with equality if f is surjective. In particular, 
YC{f^^{S)) < YC{S) and Yc(f^^{S)) < vc{S), with equality if f is surjective. 

It is easy to verify that VC(iS) = if and only if \S\ = 1, and vc{S) = if 5 
is finite; in fact, the converse of the latter implication also holds: if vc(5) < 1, then 
S is finite [T] Proposition 2.19] (and hence actually vc(5) = 0). It is also easy to 
verify (cf. [T] Proposition 2.4]) that if Si, S2 are subsets of S with S = SiU S2, then 
vc(iS) = max{vc(iSi), vc(iS2)}. In particular, vc(5) does not change if we alter finitely 
many sets from S. 
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2.2. Independence dimension. Let X be a set. Given subsets Ai,...,An of X, 
we denote by S{Ai, . . . , An) the set of atoms of the Boolean algebra of subsets of X 
generated by Ai, . . . , An (the "non-empty fields in the Venn diagram of ^i, . . . , A„"); 
that is, S{Ai, . . . , An) is precisely the set of non-empty subsets of X of the form 

PjAiH Pi X\Ai where / C [n] = {1, ... 

iei ie[n]\I 

Note that 5(^1, . . . , An) does not depend on the particular order of the Ai, so sometimes 
we abuse notation and, e.g., write S{Ai : i — 1, . . . ,n) instead of S{Ai, . . . , A„). We 
have < |5'(Ai, . . . , A„)| < 2", and we say that the sequence Ai, . . . , An is independent 
(in X) if 15(^1, . . . , An)\ — 2", and call Ai, . . . , An dependent (in X) otherwise. 
Suppose now that 5 is a collection of subsets of X. We define tt^ : N — > N by 

7r*(n) :=max{|5(Ai,...,A„)| : Ai, . . . , A„ e 5}. 

Note that < 775(71) < 2" for each n. We say that S is independent (in X) if 7r|(n) — 2" 
for every n, that is, if for every n there is an independent sequence of elements of S of 
length n. Otherwise, we say that S is dependent (in X). If S is dependent, we define 
the independence dimension IN(5) of S as the largest n such that T^^in) — 2", and if S 
is independent, we set IN (5) =00. If 5 is finite, then clearly IN(5) < \S\. 

Example 2.3. IN(5) < 1 iff for aU S,S' eS one of the following relations holds: SnS' = 
0, S* C S', S' C S, or SUS' = X. 

The function tt^ is called the dual shatter function of S, since (for infinite S) one 
has TTg — TTs- for a certain set system S* on X* = S, called the dual of S (cf. [71 
2.7-2.11] or [66] Section 10.3]). For the same reason, the independence dimension of 
iS is sometimes also called the dual VC dimension of S, denoted by VC*(iS). The 
correspondence between S and S* is explained in the following subsection. 

2.3. VC duality. Let X and Y be infinite sets, and let $ C X x F. For y eY we put 

$y ■.= {xeX:{x,y)e<i>}, 

and we set 

5$ := {%:yeY}C 2^. 

We also write 

C F X X := {(y,x) G Y x X : {x,y) G 

for the dual of the binary relation $. In this way we obtain two set systems (X, 5$) 
and (y,5$-»). To simplify notation, we denote the shatter function of 5$ by 7r$, and 
its dual shatter function by ttJ; similarly for $* in place of $. One verifies easily that 
given a finite set A C X, the assignment 

xeA' xeA\A' 

defines a bijection 

S^nA^ 5($; -.xeA). 

This implies: 

Lemma 2.4. 7r$ = 7r|,.. 
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We set VC(<i>) := VC(5$), and similarly with IN and vc in place of VC. By the pre- 
vious lemma, VC(<1') — IN($*), hence 5$ is a VC class iff 5$. is dependent. Reversing 
the role of $ and $* also yields 7r$. = 7r|,, hence VC($*) = IN($), and 5$* is a VC 
class iff 5$ is dependent. The following is also well-known (see, e.g., [T] 2.13 b)]): 

Lemma 2.5. VC($) < 2i+vc;(**)_ (-j^ particular is a VC class iff S^. is a VC 
class.) 

Example 2.6. Suppose 5$ is finite (i.e., vc($) = 0). Then 5$* is also finite. (Take 
2/1, . . . , j/AT e Y, where N = such that 5$ ~ {^yi, ■ ■ ■ , Let Xi = ^y. and 

Yi = {y G Y : = ^y.} for i S [TV]; thus $ = Xi x Yi U • • • U Xn x Yn. Hence for each 
a; £ X, $* is a union of Fi, . . . , Fat, and so there are only finitely many choices for i>*. 
Thus 5$* is also finite, of size at most 2^.) 

Clearly every infinite set system 5 on X is of the form 5 = 5$ for some infinite set Y 
and some binary relation ^ C X x Y: just take Y = S, ^ — {(x, S) : x G S, S £ S}. 
The resulting set system 5$. onY = S is called the dual S* of S in [66l Section 10.3]. 
By the above VC(5*) — VC*(iS). If 5 is a dependent infinite set system on X, then 
by Lemmas 12.11 and 12. 4[ there is a real number r > such that tt^ = 0{rf), and the 
infimum of all such r is called the dual VC density of 5, denoted by vc*(iS); note that 
Yc{S*) = YC*{S) and vc*(5) < VC*(5). 

Given $ C X x F we write ^<i> for the relative complement (X x F) \ $ of <I> in X x F . 
We clearly have tt*^ = 7r|,. It is also easy to show that given $, 5* C X x F we have 
ttJu^, < 7r|> • tt^ and hence (using complementation) 7r|,p^ < 'i'^ • tt^- By passing to 
duals and Lemma [2T4l this yields: 

Lemma 2.7. Lei * C X x F. T/ien 

vc(^<i>) = vc(<i>), vc(<I> U < vc($) + vc(v['), vc($ n < vc($) + vc(vl/). 

VC dimension does not satisfy a similar subadditivity property for unions and inter- 
sections (cf. [27l Proposition 9.2.8]). In this way, VC density is better behaved than VC 
dimension. 

An important class of relations $ such that the associated set system 5$ is dependent 
are the stable ones. An n-ladder for $ is a 2n-tuple (ai, . . . , a„, 6i, . . . , 6„) where each 
Qi G X and each bj e F, such that for all i,j € [n], 

(a„6j)e$ i<j. 

If there is an n such that there is no n-ladder for then 4> is called stable, and $ is 
said to be unstable otherwise. If ^ is stable then the largest n such that an n-ladder 
for $ exists is called the ladder dimension of <I>; if $ is unstable then we say that the 
ladder dimension of $ is infinite. Clearly if $ is stable then 5$ is a VC class (with 
VC dimension bounded by the ladder dimension). It is well-known that $ is stable iff 
$* is stable (e.g, [88l Exercise II. 2. 8]), and that Boolean combinations of stable relations 
are stable. 

2.4. Breadth. In many cases of interest complicated set systems are generated by sim- 
pler collections of subsets, and then the following lemma (essentially due to Dudley) 
can be used to show that the resulting set system is dependent. For this let X be a set 
and B he a, collection of subsets of X. 
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Lemma 2.8. Let N > and suppose S is a set system on X such that each set in S 
is a Boolean combination of at most N sets in B. Then TT^{t) < 7rg(iVt) for each t. [In 
particular, if B is dependent then so is S .) 

Proof. Let Ai,...,At £ S, and let each Ai be a Boolean combination of the sets 
Bii, . . . , BiN g B. Then the Boolean algebra of subsets of X generated by the sets Ai 
[i £ [t]) is contained in the Boolean algebra generated by the sets Bij {i e [t], j £ [N]), 
and every atom of the former Boolean algebra contains an atom of the latter. □ 

Suppose there is a d > such that every non-empty intersection i?i n • • • n Bn of 
n > d sets from B equals an intersection of a subset consisting of d of the Bi. We call 
the smallest such integer d > the breadth of B. This choice of terminology is motivated 
by lattice theory: Given a (meet-) semilattice (i,A), the smallest d > (if it exists) 
such that any meet &i A • • • A 6„ of n > d elements of L equals the meet of d of the 
bi is called the breadth of i; if there is no such d we say that L has infinite breadth. 
(See [ini Section II. 5, Exercise 6, and Section IV. 10].) So if B is closed under (finite) 
intersection and only contains non-empty subsets of X, then the breadth of B, viewed 
as a sub-scmilatticc of (2^,n), agrees with the breadth of B as defined above. Every 
set system of finite breadth is dependent: 

Lemma 2.9. breadth(S) > m{B). 

Proof. Suppose d := brcadth(S) < n := IN{B). Let Bi,...,i?„ £ B such that 
|5(Bi,...,B„)| = 2". Choose / C [n] with |/| = d and f]^eI B^ - f]^e[n]B^, and 
take J £ [n] \ I. Then n^eln]\{J} B^ = n^e[„] and hence {X \ B,) n n,e[n]\{j} B^ = 0, 
contradicting IN(i3) = n. □ 

The previous two lemmas in combination with Lemma 12.11 immediately yield the 
following useful fact (cf. [25l Chapter 5, Lemma 2.6]): 

Corollary 2.10. Suppose B has breadth d, let N > 0, and let S be a set system on X 
with the property that each set in S is a Boolean combination of at most N sets in B. 
Then 




for every t. 



In particular, TT%{t) = 0{t'^) and hence ^rc*{S) < d. 

Example 2.11. Let < be a linear ordering on X. We first recall some terminology: A 
subset 5 of X is said to be convex (with respect to <) if for all s, s' g 5 and x £ X the 
implication s<x<s'^x£S holds. So and singleton subsets are convex, as are 
intervals in X. Here and in the rest of the paper, an interval in X is a subset of the 
form 

{a,h) := {x £ X : a < X < b} 

where a, b are elements of X U {±00} with a < b. Other examples of a convex subset 
of X are its initial segments: a subset 5' of X is an initial segment of X if for all s g S' 
and X £ X, the implication x < s ^ x £ S holds. Now let S be the family of unions 
of at most N convex subsets of X, for some given X G N, and let B be the collection 
of all initial segments of X. Then B has breadth 1, and every set in 5 is a Boolean 
combination of at most 2X sets in B. Thus Tig{t) = 0{t) by Corollarv l2.10l 
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Example 2.12. Let K he a field and v. K ^ Too := F U {00} be a valuation on K . By 
an open hall in K we mean any subset of K of the form {x ^ K : v{x — a) > 7} where 
a e A', 7 e Too] similarly a set of the form {x € K : v{x — a) > j} is called a closed ball 
in K. A ball in K is an open or a closed ball in K . Any two given balls in K are either 
disjoint, or one contains the other. Hence the collection B of balls in a given valued field 
has breadth 1. Thus if S is the family of all Boolean combinations of at most N balls 
in for some N €N, then 7r|(t) = 0{t). 

The preceding examples can be subsumed under the following general example (in- 
spired by [2]): 

Example 2.13. A family B of subsets of X is said to be directed if B has breadth 1; i.e., 
for all B,B' gB with B n ^ one has B C B' 01 B' C B. UBC 2^ is directed and 
S is the family of Boolean combinations of at most N sets in B, for some N E N, then 
7r*sit) - 0{t). 

We also note: 

Example 2.14. Let G be a group and let H he a collection of subgroups of G with 
breadth d. Let B — {gH : g £ G,H £ H} he the set of all (left) cosets of subgroups 
from T-L. Then B also has breadth d. This follows from the general fact that if , . . . , 7J„ 
are subgroups of G, (71, ...,(/„ S G, then the intersection riieM f*-^* either empty or 
a coset of niefn] ^i- (So if 5 is a family of Boolean combinations of at most N elements 
of B, for some N eN, then tt* (t) = Oit"^).) 

In connection with the previous example it is worth recording: 

Lemma 2.15 (Poizat). Let G be a group and lefH be a collection of subgroups of G. 
Then breadth(-H) = IN(-H). 

Proof. By Lemma [2.91 we already know that breadth(H) > IN(H). Suppose this in- 
equality is strict. Then there are Hi, . . . , Hn+i G H, where n — IN(?^), such that 
r\ie[n+i] ^ ^ieln+i]\{j} Hi for each j e [n + 1]. So for each j <E [n + 1] we may take 

9j £ {y\ie[n+i]\{j} \ Hi- Then for every subset / of [n-|-l] the element gi := Hie/^i 
(with .90 = 1) is in nj6[„+i]\/ n nj6/(G' \ H,). This contradicts IN(-H) = n. □ 

Example. Let S be the collection of all subgroups of (Z, -|-). Then S has infinite breadth, 
hence infinite independence dimension by the previous lemma, and thus is not a VC class 
by Lemma 12.51 In particular, the collection of arithmetic progressions a + &Z (a, 6 G Z) 
in Z is also not a VC class. 

If our family B has finite breadth d, then the Helly number of B is at most d. The 
Helly number of B is defined as the smallest d > such that every finite subfamily 
. . . , Bn] of B with n > d which is d-consistent, is consistent, that is to say: if for 
every / G ('^') we have Hie/ 0) then fliGl™] ^ ^' ^°te however that conversely, 
the breadth may be infinite yet the Helly number finite, even in the case of cosets: the 
collection of arithmetic progressions in Z is independent, but has Helly number 2. Also, 
not every VC class has finite Helly number: the family whose members are the subsets of 
E with two connected components, though a VC class (of VC dimension 4), has infinite 
Helly number. (For each n the elements [0, i) U (i + 1, n], i = 0, . . . , n — 1 of this family 
form an n — 1-consistent subfamily which is inconsistent.) 

The following example is a prototype for finite-breadth families when we have a 
dimension function at our disposal: 
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Example 2.16. Define the height of B to be the largest d (if it exists) such that there 
are Bi, . . .,Bd&B with 

Si 2 Bi n B2 2 • • • 2 -Bi n • • • n 7^ 0- 

So B has height iff B does not contain a non-empty set, and B has height 1 iff B does 

contain a non-empty set, but any two distinct elements of B arc disjoint. Clearly if B 
has height d > 0, then the breadth of B is at most d. If B has height d > 1 and in 
addition B has a largest element (with respect to inclusion) then the breadth of B is 
smaller than d: to sec this \ct Bi, . . . ,Bd d B with Hielti] 7^ be given; if Bi is the 
largest element B of B then clearly Hieid] ~ nie[d]\{i} ^^'^ otherwise we have a 
chain 

B D Bi D Bi n B2 2 ■ ■ ■ 2 Bi n ■ ■ ■ n Bd ^ <D, 

hence flieb] = fliey+i] and so fliefd] Bi = ClieldWiJ+i} for some j &[d- 1]. 

The following observation (the proof of which we leave to the reader) allows us to 
produce new finite- breadth set systems from old ones: 

Lemma 2.17. Let B, B' be set systems on X and X', respectively, and consider the set 
system 

B^B' -.^ {B X B' : B e B, B' e B'} 

onXxX'. Then 

brcadth(S M B') < breadth(6) + brcadth(B'), 

and this inequality is an equality if both B and B' have breadth larger than 1 and contain 
a largest element {with respect to inclusion). 

This lemma immediately yields: 

Corollary 2.18. Let B, B' be set systems on X. Then the set system 

BnB' := {BOB' : B e B, B' e B'} 

on X has breadth at most brcadth(B) + brcadth(B'). 

Example. Suppose < is a linear ordering of X and B is the collection of convex subsets 
of X. Every element of B can be expressed as an intersection of an initial segment of 
{X, <) with a final segment of {X, <) (i.e., an initial segment of the linearly ordered set 
[x, >)). Hence breadth(B) = 2. 

If S is a sublattice of (2^, n, U) which does not contain and X, then B and the set 
system -iB := {X\B : B £ B} have the same breadth; this is an immediate consequence 
of the following lemma: 

Lemma 2.19. Suppose B is closed under (finite) intersections and, unions, and B does 
not contain the empty set. Then for each d the following are equivalent: 

(1) For all Bi,. . . , Bd+i G B there is some i € [d+V\ such that ^ Bi,- 

(2) for all B\,. . . , Bd+i G B there is some i G [d+1] such that Bi C IJj^^i Bj . 

Proof To see (1) ^ (2) apply (1) to Bl = Bj {i€[d+ 1]) in place of the 5,, and 
for the converse implication apply (2) to B" = Hj^i Bj {i G [d + 1]). □ 
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We finish our discussion of breadtli by a surprising connection between breadth and 
stability. We will not use this observation later in the paper, but we include it here 
since it shows, under the assumption of stability, the ubiquity of set systems of infinite 
breadth. The breadth of a relation between two sets is by definition the breadth of the 
associated set system, cf. Section [2731 

Proposition 2.20. Let X , Y be infinite sets and ^ C X xY be a relation. //vc($) > 
then $ is unstable, or at least one of $ or ^$ has infinite breadth. 

At the root of Proposition 12.201 is a theorem of Balogh and Bollobas [10], which we 
explain first. For this we need some additional terminology: Let {X,S) and {X',S') be 
set systems. We say that {X,S) contains {X',S') as a trace if there exists an injective 
map f : X' ^ X such that f{S') C S D f{X'). For example, if {X^S) is a set system 
and X then {X,S) trivially contains (A,5n A). Also, if {X,S) contains 
and {X',S') contains {X",S"), then {X,S) contains {X",S"). 

For fc > 2 consider now the following set systems on [k] : 

Ck = {[i] '■ i ^ [k]} (the k-chain) 

Sk ^ {{i} ■■ i & [k]) {the k- star) 

% = {[k]\{i} : i e [fc]} (the k-costar). 

Balogh and Bollobas [TOl Theorem 1] showed that these set systems are unavoidable 
among sufficiently large set systems. More precisely: for all integers fc, Z,m > 2 there 
is some N — N{k,l,m) such that every set system S on a finite base set with \S\ > N 
contains the k-chain, the l-star, or the m-costar. (Note that there is no condition on 
the size of the base set in this statement.) 

Proof of Provosition \2.20[ Let 5 = 5$. We first observe, for k > 2: 

(1) S contains Ck iff there is a fc-ladder for <i>; 

(2) if breadth(<i>*) > k then S contains Tk', and 

(3) if 5 contains Tk+i then breadth(<i>*) > fc. 

Part (1) is obvious. For (2) note that breadth($*) > fc iff there exist elements xi, . . . ,Xk 
of X such that Cljelk] ^ ^-i^d fo'' ^ach i e [fc], 

jm\{i} 

and for such choice of Xi, setting X' — {xi, . . . ,Xk} we have X' \ {xi} E S O X' for 
each i. Similarly, for (3), if X' — {xi, . . . , Xk+i} G (fe+i) such that X' \ {xi} E S H X' 
for each i E [fc + 1] , then for each such i we have 

(y\a>:jn fj K^^k, 

jelk+i]\{i} 

in particular, taking « = fc + 1 we see that (^^^[fe] and for each i e [fc] we have 

{Y \ $* J n nje[fe]\{j} ^ 0> hence breadth($*) > fc. Also note that (2) and (3) are 
true with Tfe, Tk+i and $* replaced by 5^, Sk+i and ^$*, respectively. 

Suppose now that vc($) > 0, i.e., S is infinite. Then S* = Sg,* is also infinite (see 
Example l2.6p . Then we have vc(iS*) > 1, hence there are arbitrarily large n and B e (^) 
such that |iS* ni3| > n^^^. In particular, for each N there is a finite subset of Y with 
|iS* n Bn\ > N. Now suppose <1> is stable; then <i>* is also stable. Let fco > 2 be larger 
than the ladder dimension of Then if fc > 2 and N > N{ko, fc, fc) then S* n Bn (and 
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hence S*) contains the /c-star or the fc-costar. Thus by observation (3) above, at least 
one of $ or ^$ has infinite breadth. □ 

Of course, the converse of the imphcation in this proposition also holds: if vc(<i>) = 
then iS<i) is finite, hence trivially $ is stable, and both $ and ^$ have finite breadth, 
since 5^$ is finite as well. 

Example. Let < be a linear ordering of X. Suppose B is the collection of initial segments 
of {X, <), as in Exaniple l2.11l Then — {X \B : B G B} consists of final segments of 
{X, <). Hence B and -^B both have finite breadth (indeed, breadth 1). Proposition [2201 
shows that phenomena such as these are confined to unstable contexts (for infinite set 
systems). 

Using Lemma [2. 191 Proposition 12.201 also implies: 

Corollary 2.21. Suppose X and Y are infinite sets and $ C X x y such that 5$ is an 
infinite sublattice of {2-^ ,r\,U) of finite breadth, with 0,X ^ 5$. Then $ is unstable. 

3. The Model-Theoretic Context 

Throughout this section we fix a first-order language £, and we let ip{x; y) be a parti- 
tioned £- formula (as defined in the introduction), with object variables x = [xi, . . . , x^^) 
and parameter variables y = (j/i, . . . , ?/„). The formula Lp gives rise, in a given C- 
structure, to a set system. The associated parameters introduced in the previous sec- 
tion (shatter function, VC density etc.) are elementary invariants of the structure in 
question. In Section [3T2] below we also introduce the VC density function vc-^ of a com- 
plete first-order £-theory T with no finite models: if vc^(n) is finite then vc^^n) is a 
uniform bound on the VC density of all partitioned formulas in T having n parameter 
variables. In Section 13.31 we illustrate this concept by computing vc^(l) for various T. 
In Section [3^ we then extend the definition of dual VC density to finite sets of formulas; 
this is convenient for later sections, but, as we see in Section 13.51 does not add much 
extra generality. There is some indication that computing VC density is easier when 
only parameters coming from initial segments of indiscernible sequences are considered; 
although we will not pursue these issues in the rest of the paper, we think that the 
relationship between quantities like VC or vc and their "indiscernible" counterparts 
deserves further investigation; we explore some connections in the last subsection. 

3.1. VC density of definable families. Given an /^-structure M and a tuple b G M", 
we denote the subset of M™ defined by the £-formula ip{x; b) with parameters b in M 

by 

^^(M™; b) := {a G Af™ : M h ^{a; b)}. 

A subset of Af™ is called definable (in M) if it is of the form ip^ {M™; b), for some cp 
and b. We also denote by 

:= {^^(Af™;6) : & e Af"} 

the family of subsets of defined by (p in M, and we call [M"^ ,S^) the set system 
associated with (f in M. More generally, to a given collection $(a;) = {Lpi{x]yi)}i^i 
of partitioned /^-formulas in the tuple of object variables x (and in various tuples of 
parameter variables yi) we may associate the set system 
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on M™ defined by the instances of tfie formulas ipi. If tlic /^-structure M is understood 
from the context, we drop the superscript M in our notation. 

Suppose now M is an infinite /^-structure. As usual, say that Lp is invariant under 
an extension M C AT of /I-structures if M |= (f{a\ b) <^=4> TV \= (p{a; b) for ah a e M™ 
and b e M". The following is obvious: 

Lemma 3.1. Suppose N is an C-structure with M C N and Lp is invariant under 
M C TV. Then C M™ n , hence tt^ < and therefore YC{S^) < VC(5^) 
andYc{S^)<YciS^). 

For each s,i G N, consider the ^-sentence 
:=Vx(i)---VxWV2/W...Vy(^+i) 

\l<i<j<tk=l l<k<l<s+l l<i<t J 

where x^^^ = {xi \ . . . , Xm ) and y^^^ — {y[^^ ,yn^) are tuples of new variables. Then, 
with TT^ TTgM denoting the shatter function of , we obviously have: 

Lemma 3.2. For each s,t £N, 

In particular, if N is an C-structure with M = TV, then tt^ — tt^ . 

From now until the end of this section we fix a complete £-theory T with only infinite 
models, and let M range over models of T. By the previous lemma we may set 

7r^:=<, VCM:=VC(5^), vc^ := vc(5^), 

where M is an arbitrarily chosen model of T. We call tTi^ the shatter function of ip 
(in T), and we call YC{ip) and vc((/3) the VC dimension of ip and VC density of ip 
(in T), respectively. If we want to stress the dependence of tt^ on T we write ttJ, and 
similarly for VC and vc. 

Note that the definition of only depends on the set system S^p and not on the par- 
ticular representing formula ip. In particular, t:^ remains unchanged under 0-definable 
reparameterizations: 

Lemma 3.3. Let j{z;y) be an C-formula, where z — {zi, . . . ,zi), which defines the 
graph of a map g: A'P — )■ Af". Let ^{x; z) :— 3y{'y{z; y) A 93(2;; y)), so 

5. = {^^(M™;g(c)):ceM'}. 

Then tTq- < tTj^, with equality if g is surjective. 

The dual of the partitioned £-formula ip{x; y) is ip*{y] x) :— ip(x] y); that is, 93* (y; x) 
is syntactically the same >C-formula (/?, only with the role of the object and parameter 
variables interchanged. We call YC*{ip) := YC{ip*) and vc*{ip) := vc(</J*) the dual VC 
dimension and dual VC density of ip, respectively. By Lemma 12.41 we have tt* — t:^* 
and hence YG*{ip) = IN (if) and 

vc*{ip) = inf {r e R>° : TT*^{n) ^ 0{n'')}. 

If any of the quantities VC(<p), vc{ip), YC*{ip), vc*{ip) is finite, then so are all the others, 
and in this case we say that ip is dependent or that ip defines a VC class. Note that 
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for every partitioned £-forniula ip{x; y) we have 'vc{ip) > 0, with equality if Sip is finite. 
If Sip is infinite then yc{ip) > 1. (See the remarks following Lemma [2.21 ) 

Letting $ := ip{M"';M'^) and X := M", Y A/", in the notation introduced in 
the previous subsection we have 5$ = S^p and iS** = Hence Lemma 12.71 vields : 

Corollary 3.4. We have vc(-'i^) = Yc{ip), and if Tp{x;z) is another partitioned C- 
formula, then vc{ip A ■(/'), vc(</7 V "0) ^ vc(i^) + Yc{ip). 

From Lemma 12.21 one also obtains the invariance of vc under inverse images of sur- 
jective 0-definable maps: 

Corollary 3.5. Let 6{v]x) be an C-formula, where v = (vi, . . . ,Wfe), which defines the 
graph of a map f : M™, and let p{v;y) := 3x{6 A ip), so Sp = f^^{Sip). Then 

T^p^T^ip, with equality if f is surjective. 

The theory T is NIP iff every partitioned £-formula defines a VC class. The theorem 
of Shelah |86| already mentioned in the introduction shows that in order for every 
partitioned £-formula (p{x; y) to define a VC class, it is enough that this holds for all 
such tp{x\ y) with a single parameter variable (i.e., \y\ — 1). Hence if for each partitioned 
^-formula (f{x;y) with — 1 the set system S^p has finite breadth then T is NIP, by 
Lemma [2791 The theory T is said to be stable if for every partitioned £-formula <f{x; y) 
the associated relation $ = (p{M"^; M") is stable (in the sense of Section [2.3|) : if T 
is stable then for each ip{x; y) with S^p infinite, at least one of S,p or S^,p has infinite 
breadth, by Proposition l2.20l (Corollarv l2.21l of the same proposition also yields that if 
T is stable then all finite-breadth sublattices S of the lattice of all subsets of 7\f which 
have the form S — S,p for some >C-formula f{x; y) with \x\ = m are finite.) 

3.2. VC density of a theory. We define the VC density of T to be the function 

vc = vc^ : N ^ U {oo} 

given by 

vc(n) :— sup | Yc{tf) : f{x\ y) is an £-formula with \y\ = n}. 

Note that we could have also defined vc"^ as 

vc(m) — sup I vc*{lp) : (p{x; y) is an /^-formula with = m}. 

In the introduction we already observed that vc(m) > m for every m. If C is an 
expansion of £ and T' D T a complete £'-theory, then vc^ < vc^ , with equality if T' 
is an expansion of T by definitions. Moreover, vc does not change under expansions by 
constants: 

Lemma 3.6. Let C — CU {ci : i £ 1} where the Ci are new constant symbols, and let 
T' T be a complete C -theory. Then vc^ = vc^ . 

Proof Let M' |= T' and C :== {cf^' : i G /} C M' . Let ip{x;y,z) be an ^-formula 
with |a;| = to, and let c G C'l'^L Then 7r*(^.^^^^(i) < 7r*^^.^^^)(i) for every t, hence 

vc*{cp{x;y,c)) < vc*{ip{x;y, z)) < vc'"{m) and thus vc^ (m) < vc^(to). □ 

It is clear that vc{n) < vc(n + 1) for every n, by viewing a formula with n parameter 
variables as one with n + 1 parameters; perhaps less obviously: 

Lemma 3.7. vc(rt) + 1 < vc(rt + 1) for every n. 
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Proof. By the preceding lemma we may assume that C contains a constant symbol 0. 
Let ip{x;y) be a partitioned £-formula with \x\ = m, \y\ — n. We construct a formula 
ipix, Xm+i, y, Vn+i) with Tr^{t) ■ t < TT^{2t) for every t (hence vc((y9) + 1 < vc('0)), which 
then shows the lemma. We set 

:= {xm+i = A ip{x; y)) V {xm+i = Vn+i)- 

Then for b € M", c e M we have 

V'(Af"'+i; b, c) = MM"'; b) x {0}) U (M" x {c}). 

Let A C Af™ with \A\ = t and 7r^(t) = |A n Choose pairwise distinct elements 
ai, . . . , Ot e Af \ {0} and an arbitrary element a' e Af", and set 

A' := {Ax{0})u{{a',ai),...,{a',at)}. 

Then = 2t, and for 6 e A/" and j = 1, . . . , t we have 

A'n V(M'"+i;6,aj) = ((A n ^^(Af™; 6)) x {0}) U {(a', a^)}. 

Take 6i, . . . , 6^ € Af", fc = 7r^(i), such that the sets A n (p(Af™; 6^), i = 1, . . . , fc, are 
pairwise distinct. Then the sets A' n ip{M"'^^; hi, aj) (where i = I, . . . , k, j = 1, . . . ,t) 
are also pairwise distinct. Hence 7r^(2i) > \A' D > k ■ t = TTip{t) ■ t a.s claimed. □ 

In this paper we prove, for many (unstable) NIP theories T of interest, that vc^(to) < 
00 for every m, and in fact, in these cases we establish that vc'^(m) is bounded by a 
linear function of m. Note, however, that T NIP does not imply that vc"^(m) < oo for 
all m: it is easy to see that for every T (whether NIP or not) we have vc^ (1) = oo, 
whereas T is NIP iff T®'^ is NIP. (We thank Martin Ziegler for pointing this out.) 

By Laskowski's proof [55l of Shelah's theorem f86^, the VC dimension VC((p) of an 
iZ-formula ip{x; y) is bounded in terms of the VC dimensions of certain >C-formulas with a 
single parameter variable (which, however, are astronomical, involving iterated Ramsey 
numbers). This together with the examples below raises the following question, the 
answer to which we don't know: 

Question. If vc"^(l) < oo, is vc^{m) < oo for every m? 

Provided the answer to this question is positive, one may then also ask how vc(to) 
depends on m and vc(l); e.g.: is there a function (3: N x R-'^ — >■ R-°, independent of 
T, with the property that if vc^ {1) < oo, then vc^(to) < /3(m, vc^(l)) for every m? (In 
all examples which we considered where vc^ is known to be real-valued, vc^ grows at 
worst linearly.) 

3.3. Computing vc^(l). In concrete cases it is often easy to see that vc^(l) = 1: 

Example 3.8. Suppose that M is strongly minimal. The collection B = (*^) of one- 
element subsets of M has breadth 1; so vc^(l) = 1. (Corollary 12. 101 ) 

Example 3.9. Suppose that C contains a binary relation symbol "<", M = (Af, <,...) 
is an expansion of a linearly ordered set (Af, <), and T = Th(Af) is weakly o-minimal. 
Then for every partitioned £-formula ^p{x; y) with |x| — 1 there exists an integer N > 
such that for every b e Af™, the set ip-'^{M;b) is a finite union of at most N convex 
subsets of Af. Hence vc"^(l) = 1 by Example 12.111 
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Example 3.10. Suppose that £div is the expansion of the language {0, 1, +, — , x} of rings 
by a binary relation symbol "|" . In a field K equipped with a valuation v: K ru{cxD}, 
we interpret | by putting a\h v{a) < v{b), for all a^b € K. Suppose T is a complete 
theory of valued fields in an expansion of >Cdiv, and T is C-minimal, i.e., for every 
K 1= T, every definable subset of X is a finite Boolean combination of balls in K. Then 
for every partitioned >C-formula ip{x;y) with |a::| = 1 there exists an integer TV > such 
that for every b e K™, the set (p^ {K; b) is a Boolean combination of at most N balls 
in K. Thus vc^(l) = 1 by Example 

The definition of C-minimality used in the previous example agrees (for expansions 
of valued fields) with the one in |44j : this definition is slightly more restrictive than 
the original one, introduced in [38l [62] ■ Every completion of the £div-theory ACVF of 
non-trivially valued algebraically closed fields is C-minimal (essentially by A. Robinson's 
quantifier elimination in ACVF; see [43] ) . Conversely, every valued field with C-minimal 
elementary theory is algebraically closed [38]. Moreover, the rigid analytic expansions 
of ACVF introduced by Lipshitz [57] are C-minimal [58] . 

Example 3.11. Let i? be a ring and suppose C = Cji is the language of i?- modules. (In 
this paper, "i?-module" always means "left i?-module.") Suppose M is an i?-module, 
construed as an £i{-structure in the natural way. By the Baur-Monk Theorem, every 
£/{-formula is equivalent in T = Th(i\f ) to a Boolean combination of positive primitive 
(p.p.) £fl-formulas; given a p.p. £fl-formula (p{x\y) and b £ M^y\ the set ip[M^^^]b) is 
a coset of (/j(MI^I;0). Supp ose M is p.p.-uniserial, i.e., the subgroups of M definable 
by p.p. £/j-formulas form a chain. By Example 12.141 if M is infinite, then we have 
vc"^(l) — 1. (In [B^ this will be extended to vc"^(r7i) — m for every to.) Examples for 

p.p.-uniserial abelian groups (viewed as Z-modules) include Q*-"-*, ^[pp Z(p")'^") and 

Z(p°°)("), where p is a prime and a is a cardinal, possibly infinite. Here 

Z(p) = {a/fe:a, 6eZ, b ^ 0, p] b}, 

viewed as a subgroup of the additive group of Q, Z(p") denotes the cyclic group Z/p"Z 
of order p", and Z(p°°) denotes the Priifer p-group (the group of p"th roots of unity, 
for varying n, written additively). Given an i?- module AI and an index set /, M^^-* 
denotes, as usual, the i?-submodule of the direct product consisting of all sequences 
with cofinitely many zero entries. 

Examples I3.8II3TT] mav be generalized as follows: 

Example 3.12. A family $(x) — {ipi{x;yi)}ii=i of ^-formulas in the object variables x 
(and in various tuples of parameter variables t/i) is said to have dual VC dimension d 
if the set system 5 = 5$ defined by the instances of the formulas (pi has dual VC di- 
mension d. If $ has dual VC dimension at most 1, then we say that $ is VC-minimal; 
cf. Example 12.31 We also say that $ is directed if S is directed in the sense of Exam- 
ple [US 

The £-theory T is VC-minimal if there is a VC-minimal family of >C-forniulas $(x) 
with I a; I = 1 such that in every M \= T every definable (possibly with parameters) 
subset of M is a Boolean combination of finitely many sets in 5$. (This definition was 
introduced in [2].) If T is a VC-minimal £-theory, then for every £-formula ip{x;y) 
with \x\ = 1 there exists some N G N such that in every M \= T every instance (p{x; b) 
(6 G M'^I) of ip defines a subset of M which is a Boolean combination of at most TV sets 
in 5$, by compactness. 
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One says that the VC-minimal theory T is directed if one can additionally choose $(a;) 
to be directed; in that case we have vc"^(l) = 1 by Example 12. 131 By [21 Proposition 6], 
if <I>(a;) is VC-minimal and 5$ contains some 0-definable set other than or M'^', then 
there is a directed set ^(x) of £-formulas such that 5$ — and S^^ — S-,^. By 
Lemma this yields in fact vc^(l) = 1 for every complete VC-minimal T (directed or 
not) without finite models. 

Example 13.111 can also be generalized in a different direction: 

Example 3.13. Suppose £ is a language expanding the language {1, •} of groups, and T is 
a complete £-theory containing the theory of infinite groups. Suppose for every G \= T, 
every definable subset of G is a Boolean combination of cosets of acl'''^(0)-definable 
subgroups of G. (This condition holds, in particular, if T satisfies the model-theoretic 
condition known as 1-basedness, cf. [45].) By Example l2.141 if the collection of acl°''(0)- 
definable subgroups of G has breadth at most d (in particular, by Example I2.16| if it 
has height at most d), then we have vc^(l) < d. 

Here is a particular instantiation of the previous example: 

Example 3.14. Let i? be a ring, AI an i?-module, and T ~ Th(A/) in the language Cu, 
as in Example IXTTl We have M^" = Af(^o) (sec, e.g., [H Lemma A.1.6] or [SI 
Corollary 2.24]). Set := Th{M^°) = Th(Af(^«)). It is well-known that T = 
iS the class of models of T is closed under direct products, iff for all p.p. £_R-formulas 
</j(x), ^{x), either (p(Ml^l) C ■(/^(Afl^l) or the index 

Inv(M,^,V) := [v?(mI^I) : (^ A V)(M''^')] 

is infinite. (See, e.g., 021 Lemma A.1.7].) So if T = and the Morley rank MR(T) 
of T is finite then the length n of every sequence 

M D (pi{M) 2 ipi{M) n f2iM) 3 • • • 2 ipi{M) n • • • n (pniM), 

where each ipi{x) is a p.p. £/f-formula with \x\ = 1, is bounded by d = MR(T); so 
by Examples 12.141 and 12.161 we see that vc"^(l) < d. (Note that this bound is far from 
optimal: e.g., for R = Z, M = Z(/)(^°) we have MR(T) = d, yet vc'^(l) = 1 by 
Example 13. Ill ) In [5] we will extend this to vc-^(m) < md for every m. 

3.4. Dual VC density of sets of formulas. It is convenient to extend the definition of 
dual VC density to finite sets of formulas. Let A — A{x; y) be a finite set of partitioned 
/^-formulas ip — (p{x] y) with the object variables x and parameter variables y. We set 
:= {^ip -.ipeA}, and for B C M^y^ we let 

A(a;; B) := {ip{x; b) : (p e A, b e B} . 

Given a finite set B C M'^' , we call a consistent subset of A(a;; i?)U^A(a;; B) a A(x; B)- 
type. Note that our parameter sets are subsets of M'^' , and not of M, as is more common 
in model theory. (This is simply a matter of convenience, in order to be compatible with 
VC duality.) Given a A(x; i3)-type p we denote by p^ C M'^' its set of realizations 
in M. Since we are only dealing with finite sets A and finite parameter sets B C Af'^', 
all A[x; i3)-types have realizations in M itself (rather than in an elementary extension). 
Given another finite set A'(a;; y') of partitioned ^-formulas and a finite B' C Af'^ we 
say that a A{x; i?)-type p is equivalent to a A'(x; i?')-type q if p^ — . 



VC DENSITY IN SOME NIP THEORIES, I 



21 



Let now B C Apy' be finite. Given a e M'^' we denote the A{x; i3)-type of a by 

tp^{a/B) :={ ip{x; b):beB, ip e A, M \= ip{a; b)} U 

{^(p{x; b) -.be B, ip£ A, M ^ ip{a; b)}. 

We write S'^{B) for the set of complete A(a;; i?)-types (in M), that is, the set of (in M) 
maximally consistent subsets of A(a;; B) U ^A(x; B); equivalently, 

S'^iB) ^{tp^{a/B) : ae Afl^l}. 

If A = {ip} is a singleton, we also write S'^{B) instead of S^{B). The elements of 
S^{B) are syntactical objects (sets of formulas), but associating to a type p £ S^{B) 
its set p-'^ of realizations in M gives a bijection from S^{B) onto the set 

S{ip^iA&\;b):beB,ipeA) 

of atoms of the Boolean algebra generated by the subsets ip^ {M^^^;b) of Af'^L (See 
Section [2.20 Hence for every partitioned £-formula ip{x;y) we have 

7r;(i) =max{|S"^(B)| : B C m'^I, |S| = t}. 

In the general case, for every i e N we also set 

TTlit) max {15^(5)1 -.BCM^, \B\^t}, 

so < 7r^(i) < 21"^ I*. Similarly as in Lemma |3^ one shows that if we pass from M to an 
elementarily equivalent £-structure then tt^ does not change (justifying our notation, 
which suppresses M). 

Let Ao(x; y) be a finite set of partitioned >C-formulas with Aq C A, and B C A/I^'l be 
finite. Then there is a natural restriction map S'^{B) — > S'^''{B), written as p p\ Aq. 
This map is onto: given p £ S^°{B) let a £ p^ be arbitrary; then q :— tp'^{a/B) £ 
S^{B) satisfies q \ Aq ^ p. In particular, \S^°iB)\ < \S^{B)\. Note also that if 
A 7^ 0, then the restriction maps p t-^ p\ ip, where p £ A, combine to an injective map 
S^{B) ^ U^^AS'^iB); in particular, |5^(B)| < H^eAl'^'^C^)!- This shows: 

Lemma 3.15. If all p £ A are dependent, then there exists a real number r with 
0<r < E,^GA vc*(</?) and 

15-^(5)1 = OdSr) for all finite B C M^y^ . (3.1) 

We define the dual VC density of A as the infimum vc*(A) of all real numbers r > 
such that p.ip holds; that is, 

vc*(A) = inf {r > : 7r^(i) = C'(f' )}. 

We have 

maxvc*(</?) < vc*(A) < YC*{p). 

'^^ i^eA 
Clearly vc*(A) agrees with vc*{p) as defined previously if A = {p} is a singleton. 
Moreover, vc*(A) = iSvc*{ip) ~ for every ip £ A, and if vc*(A) < 1 then vc*(A) = 0. 
(See the remarks following Lemma [2.21 ) Note that in computing vc*(A) there is no 
harm in assuming that A is closed under negation, i.e., with every Lp £ A the set A 
also contains a formula equivalent (in M) to ^p. (Passing from A to A U ^A does not 
change S^[B).) 

Example. Suppose A{x;y) = {xi =?/,... ,Xm = y} where \x\ = m and \y\ — 1. Then 
for finite B C M we have \S^iB)\ = {\B\ + hence vc*(A) = J2u,ga vc*('^) = m. 
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Wc finish this subsection with an easy result about interpretations (related to Lem- 
ma [331 and CoroUarv lS.Sp . 

Lemma 3.16. Let M' be an infinite structure in a language CJ and tt: X — > M' an 

interpretation of M' in M without parameters, where X C is (b- definable. Then 
for any finite set A'{x;y) of C -formulas there exists a finite set A{x;y) of C-formulas 
such that |A| = |A'|, \x\ = r\x\, and tt^, < tt^^. 

Proof Let m \x\ and n := \y\. Let B' C (A/')" be finite. Choose B C X" with 
\B\ = \B'\ such that each b = e B' has the form (7r(6i), . . . , 7r(&„)) for 

some (61, . . . ,bn) G B. For each £'-formula 1^9(2;; y) choose an £-formula ilj^{x;y), where 
X = {xi, . . .,Xm), y = iVi, ■ ■ ■ ,y„) and \xi\ = • • • = JS™| == lyj = • • • = |y„| = r, such 
that Vv(Af ("+")'■) C and for any ai, . . . , a™, 61, . . . 6„ € X, 

M \= V'^(ai, . . .a„;5i, . . . ,6„) -^=^ M' |= (^(7r(ai), . . . , 7r(a„i); 7r(6i), . . . , 7r(6„)) . 

Let a finite set A'(a;; y) of ^'-formulas be given. Set A := {-0^ : f e A'}. Then S'^{B) C 
X™, and (ai, . . .a™) (7r(ai), . . . ,7r(am)) yields a surjective map S^{B) {B'), 
hence |S"^'(B')| < \S'^{B)\ as required. □ 

By Lemmas 13.61 and 13.161 

Corollary 3.17. Let M' be an infinite structure in a language C , interpretable in M 
{possibly with parameters) on a definable subset of . Then, writing T = Th(A^) and 
T' — Th(7W'), we have vc^ (m) < vc^{rm) for every m. 

So for example if G is a group (considered as a structure in the usual first-order 
language of group theory) and if is a definable normal subgroup of G, then yc^^^^'^/^^ < 
.^(,Th(G) j£ jj j^g^g ijifinitc index in G, and vc"'"^'^^^ < vc"'"^'^'^^ if H is infinite. 

3.5. Coding finite sets of formulas. We let £, M and A be as in the previous 
subsection, and T = Th(7Vf). The following useful lemma, essentially due to Shelah 
[88l Lemma IL2.1], shows that counting A{x; i3)-types where |A| > 1 is not really more 
general than counting A(x; i?)-types where A is a singleton: 

Lemma 3.18. Let d = |A| and y' = (yi, . . . , i/2d, z,zi,..., Z2d) with \y\ = \yi\ = \zi\ = 
\z\ for every z = 1, . . . , 2d. There is an L-formula ipAix; y') with the following properties: 

(1) for every finite B C mI^^I with \B\ > 2 there is some B' C A/I?''! with \B'\ = 
2d\B\ such that every p G S^{B) is equivalent to some q € S^^{B'); 

(2) for every finite B' C MI^'I there is some B C M^y^ with \B\ < 2d\B'\ such that 
every q £ S'^^{B') is equivalent to some {possibly incomplete) A{x; B)-type po. 

In particular, we have 7r^(i) < 7r'^^{2dt) for t > 1 and 7rJ^(t) < ■n*^{2dt) for t > 0. 
Thus vc*(A) — vc*{ip/\) < vc^{m) where m — \x\. 

Proof. Write A = {(pi, . . . , ifd} and define ipA as follows: 
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For (1), suppose B C M'^' is finite, and 60 ^ 61 are distinct elements of B. For b & B 
and k € [d] set 

bi^^ '■= {bo, bo, b, feo, 61, bi, bo) 

yi 1/2 ■■■ Ud+k ■■■ V2d Z Zi ... Zd+k ■■■ Z2d 

and 

bf^ := {bo, bo, b, bo, bi, bo, bi, bo) 

yi 2/2 ■ ■ ■ Vk ■■ ■ y2d Z Zl ... Zk ... Z2d ' 

and put 

B' := {bi''\b['''> -.beB, k€[d]} C (mI«I)4'^+i. 
Then \B'\ = 2d\B\, and for every b e B, k e[d] we have 

Given p € S^{B) we set 

q ■= {^^A{x;b''Q ^), '4)\{x;bf^) : ipk{x;b) e p} U 
{ i^A{x;b^^'^),^iJA{x;bf'^) : ipk{x;b) ^ p}. 

Then clearly 9 e S'^^{B'), and g is equivalent to p. Themappi-^ g: S"^(S) S'^^{B') 
is injective, hence < |S"^^(B')| < 7r;^(2d|B|). 

For (2) note that if 61, ... , b2d, c, ci, . . . , C2d G M'^' then the formula 

V'A(a;;6i,...,&2d,c, ci,...,c2d) 

defines ipk{M^^^;bk), ^(pk{M^^^;bk+d), or (since the q's are not necessarily distinct). 
Let B' C Mi?''! be finite, and q e S^{B'). Set 

i? := {& e A/I^l ■■b = bi for some (61, . . . , &2d,c, ci, . . . , C2d) e B'} 

and let po be the set of formulas which have the form ipk{x;b) where k ^ [d], b = bk 
for some V'A(a;; &i, . . . , 62^, c, ci, . . . , C2ci) G 9 with c = c^, or the form -^(pk{x; b) with 
A; e [d], & = 6d+fc for some ipA{x;bi, . . . ,b2d,c,ci, . . . ,C2d) G 9 with c = Cfc+d. Then 
\B\ < 2d\B'\, and po is a A(a;; i3)-type equivalent to q. For each q choose an extension p 
oipo to a complete A(a:; i?)-type. Then the map p: S^'^ [B') S'^{B) is injective, 
so \S^^{B')\ < \S^{B)\ < 7r^(2d|B'|). □ 

In the rest of this subsection we give some applications of this lemma. We first note: 

Corollary 3.19. Let $ fee a set of C- formulas with the tuple of object variables x and 
varying parameter variables such that every C-formula ip{x; y) is equivalent in T to a 
Boolean combination of formulas in Then 

vc'" (m) = sup { vc*(A) : A C $ finite } where m — \x\. 

Proof. The inequality "<" is a consequence of the hypothesis: for each £-formula 1^9(2;; y) 
there is a finite subset A = A{x;y) of $ such that \Sf{B)\ < \S^{B)\ for each finite 
B C M'*''. The reverse inequality follows from the previous lemma. □ 

Let M* )^ M be a monster model of T. Consider the expansion of £ by a new 
predicate symbol R^^dx) for every £-formula ipix; z) and every c £ (M*)l^l. The Shelah 
expansion of M is the expansion of M to an £^''-structure M^^ where each predicate 
symbol R^^c(x) as before is interpreted by ilfl^l n {{M*)\='\;c). Shelah showed 
[59] (with another proof given in [20]) that if T is NIP then T^^ = Th{M^^) admits 
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quantifier elimination and is also NIP. This provides an interesting way of constructing 
new NIP theories from old ones. The previous lemma and its Corollarv 13.191 allows us 
to prove that T and T^*^ share the same VC density function: 

Corollary 3.20. vc^""' = vc^. 

Proof. Fix some m and assume |x| = m. The inequality vc''"^ (m) > vc"^(m) being 
obvious, we only need to show that vc^ (m) < vc^{m). Let A = A{x;y) be a finite 
set of atomic £^''-formulas; by Corollarv 13.191 and Shelah's theorem mentioned above, 
it suffices to show that vc*(A) < vc"^(to). Take a finite set vl> = vl'(x; y, z) of partitioned 
£-formulas and some c g M'^I such that A = {R^^c{x;y) : V e 5'}. Let B C M^y\ be 
finite, B* := B X {c}, and let p € S'^{B). Let a be an arbitrary realization of p (in 
M^h), and define p* tp*(a/B*) (in M*). Then for ?/; e * and b & B we have 

^p{x;b,c) e p* <^ M*|=i/'(a;&,c) 
^ MS'>i?^,,(a;6) 
<^=> R,p^c{x;b) e p. 

In particular, the map p ^ p* : S^{B) 5*(i?*) is injective, so vc*(A) < vc*(5') < 
vc"^(m) by Lemma [3. 181 □ 

It is well-known (see, e.g., |100| Theorem 4.7]) that the direct product of two NIP 
structures is again NIP. As a consequence of the last lemma we can also now estimate 
the VC density of a direct product in terms of the VC densities of its factors. We refer 
to [42l Section 9.1] for the definition of the product of two £-structures, and to [42l 
Corollary 9.6.4] for the Feferman-Vaught Theorem used in the proof below. 

Lemma 3.21. Let M' be another infinite C-structure, T' = Th(M'), and let T'^ = 
Th(M X M') be the L-theory of the direct product of M and M' . Then 

vc^ < vc^ + . 

Proof. Given n-tuples a = (ai, . . . , a„) e M" and a' = {a'l, . . . , a^) € (M')" we denote 
by a X a' the n-tuple ((ai, a'j^), . . . , (a„, a^)) of elements of M x M'; every element of 
(M X M')" has the form a x a' for some a G M", a' G (M')". 

Let (p{x; y) be an £-formula. By the Feferman-Vaught Theorem there exist finitely 
many pairs of ^-formulas {6i{x-y),6^{x;y)), i & [n] — {l,...,n}, such that for all 
a e mI^i, a' e (M')l'=l and b e M^y^, b' e (M')!"!, 

MxM' \^ ip{axa']bxb') ^ for some i e [n], M ^ 6*^(0; and M' |= 6i^(a'; 6')- 

Set e := {6*1, ... , On}, 9' {e[, 6*;}. Let C be a finite set of tuples from (M x 
M')\y\. Take B C mI«I, B' C (M')'"' with JB], \B'\ < \C\ such that each c G C is of 
the form c = 5 x 6' for a unique pair (6, b') G B x B' . For every p G S'^{C) choose a 
reahzation Op x G {M x M')^^^ of p in M x M', and put 

q:=tp'^{ap/B), q' -.^ tp'^' {a'^/ B'). 
Then for all (6, b') e B x B' we have 

tp{x;b X b') e p <^=^ M X M' ]= (^(flp X flp; 6 X 6') 

■^=^ M ]= ei{ap; b) and M' ]= 6'-(ap; 6'), for some i G [n] 
<^=> 6*^(2:; b) <E q and (x; 6') G g', for some i G [n]. 
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Hence the map p (-> {q,q') is an injection S'^{C) -> S^{B) x 5® {B'). In particular we 
obtain 7r*(i) < 7rQ(t) • 7^^,(1) for every t and hence vc*{ip) < vc*(8) + vc*(8'); here tt* 
is computed in M x M' and ttq, ttq, in Ai" and M' , respectively, and similarly for vc*. 
By Lemma 13.181 therefore vc^ (m) < vc'^(m) + vc"^ (m) where m = □ 

Remark. In a similar way one shows that if M' is a finite >C-structure and = 
Th(M X M'), then vc^'' < vc^. 

We finish this subsection by noting a further restriction on the growth of vc (cf . also 
Lemma [3?f|l : 

Lemma 3.22. dvc{m) < vc{dm) for all d,m > 0. 

Proof. Let A{x]y) be a finite set of >C-formulas with |x| = m. Let xi, ...,Xd be new 
m-tuples of variables and set 

/S'{xi,...,Xd\y) := {'^{xi;y) : Lp{x]y) G A, i = l,...,d}. 

Let B C M\y\, \B\ = t & n, such that r := 7r2,(t) = \S^{B)\. Let ai,...,a^ e 
M™ be realizations of the types in S^{B). For each i — (ii, . . . G [r]'^ let Oj := 
(flij , . . . , Ui^) £ (Af™)'^ = Af^™. Then the Oi realize pairwise distinct A'{xi, . . . , a;^; B)- 
types. This yields (7r^(i))'' = \S'^iB)\'^ < \S^' {B)\ < 7r^,(i)- Since t was arbitrary, we 
obtain dvc*{A) < vc*(A'). Hence dvc(m) < vc{dm) by Lemma [3. 181 □ 

3.6. VC density and indiscernible sequences. In this subsection we assume that 
M is sufficiently saturated. Recall that n^{t) is the maximum size of iS,^n A as A ranges 
over t-element subsets of Af™, and 7r*(i) is the maximum size of S'^{B) as B ranges 
over all ^-element subsets of Af"; here, as above m = \x\, n = \y\. These definitions 
may naturally be relativized to parameters coming from indiscernible sequences. More 
precisely: 

Definition 3.23. For every t let 'rT^p,ind{t) be the maximum of C) A\ ds. A ranges 
over all sets of the form A = {ao, . . . , at-i} for some indiscernible sequence (ai)^^^ in 
A/™, and let tt* be the maximum of where B = {69, ■ • ■ , for some 

indiscernible sequence (&i)igN in A/". We call 7r^,ind the indiscernible shatter function 
of and TT* jj^j the dual indiscernible shatter function of Lp. 

The indiscernible shatter functions give rise to corresponding notions of indiscernible 
VC dimension VCind(<p) and indiscernible VC density vcind(¥') of (and their duals 
VCfjjj(</j) and VC*^(j(</5)) in a natural way; for example, is the infimum of all 

r > having the property that there is some C > such that for all t and indiscernible 
sequences (6i)igN we have \S'^{B)\ < Ct^ , where B = {bo, . . . , &t-i}; if there is no such r 
then vcf^d((^) = ex.. 

As in the classical case (cf. Lemma 12. 4p we see that tt* j^^^j = 7r^-,ind and hence 
VCind(<P*) = VC*^(j((p) and vcind(¥'*) = '^c*„(j(iy9). Directly from the definition we have 
T^ip,ind — '^v and hence VCind(¥') < VC((/p) and vcind(<y5) < vc((y9). In particular VCind(</') 
and vcind(v') are finite if (p defines a VC class. Conversely, if VCind(</') is finite, then so 
is VC ((/?). (This follows by saturation of M and extraction of an indiscernible sequence; 
see proof of Proposition 4 in (5-) Hence if one of the quantities YG{ip), yc{ip), VCind(¥'), 
or vcind(</') is finite, then so are all the others. 

Another numerical parameter associated to Lp and defined via indiscernible sequences 
is the alternation number alt((/3) of Lp (in M). This is the largest d (if it exists) such 
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that for some indiscernible sequence (ai)igN in M"^ and some b £ Af" we have 

a, e (p{M"'; b) ^ a,+i ^ ^(M™; b) for alH < d - 1. 

If there is no such d we set alt(iy9) — oo. It is well-known (and essentially due to Poizat) 
that alt((/j) < 2VCind(</') + 1 (see, e.g., [3l Proposition 3]) and that if alt((p) is finite 
then ip defines a VC class jHl Proposition 4]. Moreover: 

Lemma 3.24. vci„d(<y5) < a\t{ip) — 1. 

Proof. Since this is trivial if ip has infinite alternation number, we assume that d := 
alt(<p) < CO. Let (ai)igN be an indiscernible sequence in M™ and A — {uq, . . . , at-i}. 
Then for each b € M", there are less than d indices i < t — 1 such that (p{ai;b) 
and (p{ai+i; b) have different truth value in M, and the set A n p{M"^] b) is uniquely 
determined by knowledge of these indices i. Thus |j4n5,^| < 2^^"^^ (*) = 0{t'^^^) and 
hence vCind(<y5) < d — 1 as required. □ 

Example. Suppose C (^^ ) where d > 0. Then alt(i^) < 2d + 1 and vciiid(<<2) < 
vc(iy9) < d, and all these inequalities are equalities if S^p = ) . 

The previous example shows that the inequality in Lemma 13.241 in general, is strict. 
The inequality VCind(v) < VC((/3) may be strict if there are no non-trivial indiscernible 
sequences: 

Example. Suppose C — {A, S, P} where A and S are unary relation symbols and P is 
a binary relation symbol, and suppose M is an /^-structure, with the interpretations of 
A, S and P in M denoted by the same symbols, such that 

(1) \A\ = d and \S\ = 2'*; 

(2) for s G S, P{x, s) defines a subset of A so that when s runs through S we obtain 
all subsets of A; 

(3) for s ^ S, P{x, s) defines the empty set. 

Then VC(F) = d and VCind(/') 1 (as well as vc(P) = vci„d(/') = 0). 

The inequality vcind(</') < vc((/3) may also be strict, as Lemma 14751 in the next section 
shows. We do not know the answer to the following question: 

Question. Is vcind('y3) always integral- valued? 

(After a first version of this manuscript had been completed, Guingona and Hill [35] 
showed that this question indeed has a positive answer.) 

We finish this section with a connection between vCj*,^^ and the Helly number. We 
already remarked (see Section [274| that if M — (M, <) is a dense linearly ordered set 
and p{x; yi, zi, ?/2, Z2) = (yi < x < zi V y2 < a; < Z2) then the set system S^p has infinite 
Helly number: that is, for each d there is a finite subfamily of which is d-consistent 
yet inconsistent. In contrast to this, we have: 

Lemma 3.25. Put d — L'^c*jjj((^)J -I- 1. Then for every indiscernible sequence [biju^n in 
Afl^l the set system S = {(p^M™", bi) : i € N} has Helly number at most d. 

Proof. Suppose for a contradiction that (6i)igN is an indiscernible sequence such that 
S — {p{M"''; bi) : i e N} has Helly number larger than d. Then some finite subfamily Sq 
of iS is d-consistent but not consistent. By indiscernibility of {bi), every finite subfamily 
of S of size at least |iSo| has this property. In particular, we can take D G N maximal 
such that the set {ip{M'^;bi) : i < D} is consistent. Obviously D > d. Since {bi) is 
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indiscernible, we obtain that for any /q G (^) the set {ip{M"^; bi) : i G Iq} is consistent, 
but for any D' > D and any Ii € (^,) the set {(y9(M™; bi) : i ^ Ii\ is inconsistent. Let 
f > _D be arbitrary, and set Bt = {bi : i < i\. For / e (^) let (7/(2:) be the unique 
</3-type over Bt with (/9(x; 6^) £ qi for i G / and ^<y5(.T; bi) G q/ for i ^ I. Since |/| = D 
every qi is consistent. Thus |S"''(i3t)| > (^) = ©(i^)- Since D > d, this contradicts 
<„d(¥') < □ 

Remark. Note that in the context of the previous lemma, we cannot achieve the stronger 
conclusion that S has breadth at most d: for the formula f{x] y) = x ^ y and any 
indiscernible sequence {bi), the set system S always has infinite breadth. 

By Lemma 13.251 and extraction of an indiscernible sequence (using that M is as- 
sumed to be sufficiently saturated) we obtain a consequence which does not mention 
indiscernibles: 

Corollary 3.26. Suppose the set system S^p is d-consistent, where d = lvc*{ip)\ + 1. 
Then there is an infinite subset of S^p which is consistent. 

This is a weak version of a theorem of Matousek [HT], according to which, if Sip is 
d-consistent, where d > vc*{(p), then one may write S^p = SiU - ■ - USn (for some TV G N) 
where each Si is consistent. 

4. Some VC Density Calculations 

In this section we give an example of a formula in the language of rings which, in 
every infinite field, defines a set system with fractional VC density, depending on the 
characteristic of the field. The construction of this formula (which is inspired by an 
example by Assouad [7], who in turn credits Frankl) proceeds in two steps: we first 
associate to a given partitioned formula ip a bigraph (= bipartite graph with a fixed 
ordering of the bipartition of the vertex set), and then we realize the set of edges of 
this bigraph as a definable family S;^. For our example we choose (p so as to encode 
point-line incidences in the affine plane; the calculation of vc(^) in characteristic zero 
uses an analogue of the Szemeredi- Trotter Theorem due to Toth. We also discuss 
the question whether VC density in NIP theories can take irrational values, and give 
examples of formulas in NIP theories whose shatter function is not asymptotic to a real 
power function. 

Throughout this section £ is a first-order language and M is an £-structure. 

4.1. Associating a bigraph to a partitioned formula. We follow [59j and make a 
distinction between bipartite graphs and bigraphs. A bipartite graph is a graph (V, E) 
whose set V of vertices can be partitioned into two classes such that all edges connect 
vertices in different classes. By a bigraph we mean a triple G = {X, Y, $) where X and Y 
are (not necessarily disjoint) sets and $ C AT x y. Thus a bipartite graph can be viewed 
as a bigraph if we fix a partition and specify which bipartition class is first and second. 
Conversely, if G = {X, Y, $) is a bigraph then we obtain a bipartite graph {V{G),E{G)) 
(the bipartite graph associated to G) by letting V{G) be the disjoint union of the sets 
X and Y, and E{G) = 3"; by abuse of language we call V{G) the set of vertices of 
G and E{G) the set of edges of G. We also say that G is a bigraph on V = V{G). 
(What we call a bigraph G — (A, Y, $) is sometimes called an incidence structure, and 
{V{G),E{G)) is called its Levi graph or incidence graph.) A bigraph is said to be finite 
if its set of vertices is finite. It is easy to see that a finite bigraph G can have at most 
i|y(G)|2 edges. 
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A bigraph G' = [X' ,Y' ,^') is a sub-bigraph of G = {X,Y,^) if X C X', y C Y', 
and <i>' C $. We say that a bigraph G contains a given bigraph G' (as a sub-bigraph) if 
G' is isomorphic to a sub-bigraph of G. Given a bigraph G = (X, Y, $) and a subset V 
of its vertex set V{G), we denote by 

Gfv^ := (xnF,y n v^,$n (V" X v)) 

the sub-bigraph of G induced on V . The complement of a bigraph G = {X, Y,^) is the 
bigraph ^G := (X, F, ^$), and its dual is G* := (Y, X, $*) where ^$ and $* are as in 
Section [231 

Let (p{x] y) be a partitioned >C-formula, where \x\ = m, jyl = n. We may associate a 
bigraph G^ = (X, F, $) to 9? and M, where X = M", F = M", and 

$ = ip{M"';Ar) = {(a, 6) G Af™ x Af" : M ^ ^(a;&)}- 

Note that G^^p — -^G^ and G^- — {G^)* . If we want to stress the dependence of G^p on 
M , then we write G^ instead of Gip. If ip is invariant under the extension M C AT of 
£-structures, then G^ \V = G^ where V ^ V{G^). 

From now on until the end of this subsection we assume that M is infinite and m = n. 
The collection 

E[G^) ^ {{a,b) : (a, 6) e (^(Af"; A//")} C A//" x Af" 
of edges of G^ then maps naturally onto the definable family 

= {{a,b} : (a, 6) € ^(Af"; Af")} C 

of subsets of Af" by a map whose fibers have at most 2 elements; here 0{v; x, y) is the 
partitioned >C-formula with object variables v = (vi, . . . ,Vm,) and parameter variables 
{x, y) given by 

(fiv; X, y) := if{x; y) A {v = x V v = y). 

Note that VC(^) < 2. Also, — and hence (f* and cp have the same VC dimension 
and VC density. A bound on the number of subsets of a given finite set which are cut 
out by 5^ may be computed as follows: 

Lemma 4.1. Let A C Af™ be finite. Then 

\Ao\ + l\E{G^ \V)\ < |An5^| < 1 + |Ao| + \E{G^ \V)\ 

where 

(1) Aq is the set of all a d A such that M |= 1^9(0; b) or M |= ip{b; a) for some 
b e M"\ but there is no b £ A with M \= Lp{a; b) or M |= (p{b; a), and 

(2) V C V{G^p) is the disjoint union of A considered as a subset of X and A 
considered as a subset ofY. 

Proof. Each set S E A n 5^ is of one of the following types: S — (d; S — {a} where 
a € ^o; or S" = {a, b} where a,b £ A with M |= (p{a; b) or M \— (p(b; a). Each set of the 
last two types actually occurs in A n 5^, whereas S — ^ only occurs iff there is some 
edge (a, b) of G^ with a,b ^ A. □ 

Hence if we set 

n^{t) ■.= m&x{\E{G^ \V)\ -.VCViG^), \V\ = t} G N, 
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then the lemma shows that 

in^(t) < 7r^(i) < l + t + n^(2t) for every t. (4.1) 

This observation opens up a road to computing (upper or lower) bounds on the VC 
density of the formula (p: find a bound on the number of edges of the subgraph of G^p 
induced on finite subsets of its vertex set, in terms of the number of vertices. In the 
following we give some applications of this approach. 

For positive integers r and s we denote by Kr.s '■= (H, [s], [r] x [s]) the complete 
bigraph with the vertex set [r] U [s]. The following is a fundamental fact about finite 
bigraphs: 

Theorem 4.2 (Kovari, Sos and Turan [51]). Let r < s be positive integers. There exists 
a real number C — C'{r, s) such that every finite bigraph G which does not contain Kr.s 
as a sub-bigraph has at most C \V{G)\'^~^^^ edges. 

(In fact, a more precise bound is also available, in terms of the sizes of the vertex 
sets X and Y, but we won't need this.) 

Corollary 4.3. Let r < s be positive integers. There is a real number Ci ~ Ci(r, s) 
with the following property: if ip{x; y) is an C-formula such that G^p does not contain 
Kr^B IS a subgraph, then 7r^(i) < Ci t'^~^^^ for every t; in particular, vc{(p) < 2 — 

Proof. If F C V{Gip) is finite, and the bigraph G^p \ V does not contain Kr^s, then 
\V)\< C by TheoremHSl where G = C(r, s) > is as in that theorem. 

Thus TTfi{t) <l + t + n^(2t) < 2(1 + 2i-i/'-C) t^-iA by □ 

Given integers r, s > 1, the bigraph contains Kr^s if and only if there are pairwise 
distinct ai, . . . , € M™ and pairwise distinct bi, . . . ,bs € M™ such that M \= ip{ai; bj) 
for all is [r], j G [s]. It is interesting to note that if does not contain Kr,s as a 
sub-bigraph, for some r,s > 1, then the bigraph G^p associated to -^(p does contain 
Kt^t, for every t > 1: by an analogue of Ramsey's Theorem for bigraphs due to Erdos 
and Rado [32], for every t there exists an n such that for all bigraphs G with |V^(G)| > n, 
one of G, ^G contains Kt^t as a sub-bigraph. Hence in this case the VC density of the 
formula ^ associated to ^ip equals 2, by (|4.ip . 

4.2. Point-line incidences. Let K be an infinite field, construed as a first-order struc- 
ture in the language of rings as usual. The partitioned formula 

ip{xi,X2;yi,y2) X2 = yixi + y2 
gives rise to the bigraph G^p = {X, Y, $) where X = Y = and 

We may think of V{G^) = X\JY as the disjoint union of the set X of points p — {r/, ^) G 
in the affine plane A'^{K) over K and the set Y of non- vertical lines £ in A'^{K); 
thus E{Gip) is the set of point-line incidences {p,£) where p £ A'^{K) and £ C A'^{K) is 
a non-vertical line containing p. The bigraph G^p does not contain K2^2 as a subgraph. 
(Two distinct points in A'^{K) lie on a unique line.) Hence by Corollary 14.31 

Corollary 4.4. There is a real number Gi > (independent of K) such that 'Kp{t) < 
Gi ^3/2 for every t; in particular, vc{(p) < ^. 
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Note that this bound is better than what we get from the general estimate vc < VC, 
since VC(<^) = 2. Also, if if = M, then for om' original formula Lp we have 7r^(i) = 
1 + t + (*) for every t. In particular VC((^) — vc((^) — 2. 

A lower bound on vc((^) is given by: 
Lemma 4.5. Suppose K has characteristic 0. Then vc((^) > |. 

Proof. This is due to Erdos, with the following simpler argument by Elekes |31j : let k 
be a positive integer, t = Ak^ , and consider the subsets 

{(77,C) :?y==0, l,...,fc-l,e = 0,l,...,4fc2-l} 

L {(a, 6) : a = 0, 1, . . . , 2fc - 1, 6 = 0, 1, . . . , 2fc2 - l} 

of Z^, and set V PUL C V{G^p). Then for each i = 0,1, . . . , k—l, each line rj — a£, + b 
with (a, b) Cz L contains a point (?], ^) G P with ^ = i, so 

|i?(G^ > fc • |i| = 4fc4 = p3_t4/3 ^ 1 |y|4/3 

and hence vc{ip) > | by (|iT|) . □ 
The precise value of vc(^) depends on the characteristic of K: 

Proposition 4.6. 

(1) Suppose K has characteristic 0. Then vc(^) — |. 

(2) Suppose K has positive characteristic. Then yc{ip) — |. 

In the proof of this proposition we use the following generalization of a famous theo- 
rem of Szemeredi and Trotter [96] (although a weaker version of this theorem from [93] , 
with a somewhat simpler proof, would also suffice for our purposes): 

Theorem 4.7 (Toth [97]). There exists a real number C such that for all m,n > 
there are at most C{m?/^n'^/^ + m + n) incidences among m points and n lines in the 
affine plane over C. 

Proof of Proposition \4.6\ The lower bound vc{(p) > | in (1) was shown in the previous 
lemma. From Theorem 14.71 and Lemma 14.11 we obtain vc^(^) < |. E K is any field of 
characteristic with algebraic closure K^^^, then tt~ < n~ = ttS by Lemmas 13. II and 
13. 2[ showing part (1) of Proposition 14.61 

The upper bound vc{(f) < | in (2) is a consequence of CoroUarv 14.41 For the lower 
bound we use the following observation: if F is a finite subfield of K, say — q, then 
\V{G^)\ = 2q^ and \E{G^)\ = q\ hence 

\EiG!^ \V)\ = \EiG^)\ = -^\V\'^' where V = V{G^). 

Together with (14.11) this yields the inequality vc(^) > | in (2). □ 

Proposition 14.61 shows in particular that there is no hope for a "Los Theorem" for 
VC density: if M is a non-principal ultraproduct of a family [Miji^i of infinite C- 
structures, then one may have vc'^ {ip) ^ vc^^{lp) for all i S /. 

It is interesting to contrast Proposition 14.61 with the outcome of only considering 
parameters from an indiscernible sequence: 

Lemma 4.8. The formula (p has alternation number 2, hence vcind(<?) = 1- 
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Proof. It suffices to show alt((^) = 2, since then Lemma 13.241 yields vcind(<?) = 1- 
Suppose for a contradiction that (ai)igN is an indiscernible sequence in and b = 
(p, G K"^ X witnessing that alt(^) > 3. We think of the elements of K"^ both as 
points in the affine space A'^{K) over K and as non- vertical lines in A'^{K), and let i, j 
range over {0, 1, 2, 3}. The are pairwise distinct, p € £, and = p, aj = £ for some 
i ^ j] hence ai £ aj for some i j. If e aj where i < j, then G for aZ/ i < j 
(by indiscernibility) and hence ao,ai G a2 H 03, and this forces ao = fli or a2 —0,3, in 
both cases a contradiction. Similarly the assumption that G with i > j leads to a 
contradiction. □ 

Many other results in the combinatorial literature lead to non-trivial (upper and 
lower) bounds on vc(^) if (f encodes the incidence of points on various geometric objects; 
see [Oni Chapter 4] or [71]. For example, let R = (M, 0, 1, +, — , x , <) be the ordered 
field of real numbers. Let 

<f{xi,X2;yi,y2) ■= {xi - yif + {x2 - y2f = 1, 

so is the collection of circles with radius 1 in the plane, and E{G^) is the set 
of incidences between points in and circles of radius 1. Then an analogue of the 
Szemeredi- Trotter Theorem [95] (or a more general result due to Pach and Sharir [73] 
on families of simple plane curves) and (j4.ip yields vc^{(p) < |. (However, it is unknown 
whether this bound is sharp, cf. [TH Section 2].) 

4.3. Irrational VC density. In [8 it is shown that for every real number r > 1 there 
exists a set system S C (p^-|) with vc(5) = r. Wc do not know the answer to the 
following question (though we suspect the answer to be negative): 

Question. Is the VC density of a formula in a NIP theory always rational? 

Let £gr = {E} be the language with a single binary relation symbol E. The Cgi- 
structures are nothing but the (directed) graphs (with E interpreted as the edge rela- 
tion). Given a graph G we denote by V{G) its set of vertices and by E{G) its set of 
edges. Spencer and Shelah fOT established a 0-1-law for £gi.-sentences about random 
(symmetric, loopless) graphs with n vertices and edge probability n~", where a is an 
irrational number between and 1. We denote the resulting complete £gr-theory by T^. 
It was shown by Baldwin and Shelah [9 that Ta is stable. (This can also be checked 
by simply verifying that Ta is superflat in the sense of [77|; cf. Section |4^ below.) In 
particular, Ta is NIP, so it makes sense to investigate VC density of formulas in Ta- We 
consider the following >Cgr-formula: 

(p{x;y) = E{x,y) V x = y. 

For any graph G and vertex v of G, the formula (p{x; v) defines the (closed) neighborhood 
of u, i.e., the set consisting of v together with all vertices adjacent to it. It is tempting 
to guess that vc{(p) = 1/a. (This would give rise to a negative answer of the question 
posed above.) However, it turns out that vc{(p) is an integer: 

Lemma 4.9. yc{ip) — [1/aJ. 

Before we give the proof, we recall some basic facts about the theory Ta', our main 
reference is [9j . We let G be a model of Ta . 

A rooted graph is a pair (i?, H) where H is a finite graph and R a proper subset of its 
set of vertices; the elements of R will be called roots. We consider each finite non-empty 
graph as a rooted graph by identifying it with (0, iJ). Given a rooted graph {R, H), a 
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S^{S,S\...} 



R=[t] 



Figure 4.1. The rooted graph associated to a set system 



rooted graph (_R, H'), where H' is subgraph of H whose vertex set properly contains R, 
is caUed a rooted subgraph of {R, H) . 

A weak embedding of a rooted graph (R,H) into G is an injective map l: V{H) — > 
V{G) such that for all roots v and non-roots w of {R, H), v and w are adjacent in H iff 
l{v) and l(w) are adjacent in G; such a weak embedding is called an embedding if also 
any two non-roots v and w of {R, H) are adjacent in H iff i(t;) and l(w) are adjacent 
in G. Note that there is no requirement about edges between roots. (This terminology 
does not appear in |94] which talks about "(i?, iJ)-extensions" instead.) 

Let {R,H) be a rooted graph. The average degree of {R,H) is a.deg{R,H) := 2e/v 
where v = v{R,H) > is the number of vertices of H which are not roots and e = 
e(i?, H) is the number of edges of H which do not have both ends in R. The maximum 
average degree mdeg(i?, i?) of {R, H) is defined as the maximum of adeg(_R, iJ') where 
{R, H') is a rooted subgraph of (i?, H). If adeg(i?, H) > 2/a then (i?, H) is called dense, 
and sparse otherwise (i.e., if a.deg{R, H) < 2/a). If mdeg(_R, iJ) < 2/a then {R,H) is 
called safe, and unsafe otherwise. 

Now if H is dense then G does not contain a copy of iJ, whereas if H is safe then G 
contains a copy (indeed, an induced copy) of H. More generally, if {R, H) is unsafe then 
there is no weak embedding of (R, H) into G [94l P- 69], and if {R, H) is safe then every 
injective map R — > V{G) extends to an embedding of (i?, H) into G [Ml Theorem 5.2.1]. 

Let now 5 be a non-empty set system on[t\ — {\, . . . ,t} , where i > 0. We associate a 
rooted graph [R, H) — {Rs, Hg) to S as follows: the set of vertices of H is the disjoint 
union of [t] and S, the set of roots is i? = [t], there are no edges between two roots and 
no edges between two non-roots, and a root i € [t] and a non-root S € S are related 
by an edge iff i e S*. (Cf. Figure |4?T] ) Note that this rooted graph has average degree 
adeg(i?, H) — j|j X^sesl*^! ^^'^ maximum average degree 

2 

mdeg(i?, iJ)= max t-t-t > IS'l. 

So if 5 C (W) where k e {0,...,t} then adeg(i?, H) = mdeg(i?, H) = 2k; hence if in 
addition k < 1/a then {R,H) is safe (so there exists an embedding of {R,H) into G) 
whereas ii k > 1/a then {R,H) is dense (and so there is no weak embedding of {R,H) 
into G). 

Proof of Lemma \4-9\ Applying the remarks above to 5 = (^j^/j^.j), where t > 0, we see 
that, as [1/aJ < 1/a, there are pairwise distinct vertices oi, . . . ,at and bs (S G S) of 
G such that and &s are adjacent iff i e 5", i.e., writing ^ = {oi, . . . ,at} we have 
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A n ip{G;bs) = {ai : i e S} and hence \A n > \S\. So 7:^(1) > for each t, 

therefore vc{(p) > [1/aJ- 

To show the reverse inequahty suppose for a contradiction that vc{ip) > [l/aj . Let 
£1 be a real number with vc{(p) > g > [l/aj . Note that for every set A of vertices of G 
the set system ACiS^p is the union of the set system 

{{b}U{ae A: {a,b) e E{G)} -.be A} (4.2) 

consisting of at most \A\ sets, and 

{{aeA: (a, b) G E{G)} : b G V{G) \ A}. (4.3) 

Since vc{ip) > g, for every C > 1 there are arbitrarily large t > and A C V{G) with 
\A\ = i such that |A n iSi^| > 2Gt^. The set system (|4.2p has at most t elements; hence 
(|4.3p contains at least 2Gt^ — t sets and so, since g > [1/aJ > 1, contains at least Ct^ 
sets. Identifying A with [t], the set system (|4.3p on A thus gives rise to a set system S 
on [t] with \S\ > Ct^ whose associated rooted graph {Rs,Hs) weakly embeds into G. 

On the other hand, let S be any set system on [t] such that (i?5, Hg) weakly embeds 
into G, and for fc = 0, . . . , t consider the set system Sk :— S ('^J) on [t]. If 5fe 7^ then 
{RSk,Hs^) is a rooted subgraph of {Rs,Hs) and hence 

2k = adeg(i?5,,i/5j < mdeg(i?5,i?5) < 2/a. 

Therefore 5 does not contain a fc-element subset of [t] with k > l/a, i.e., S C 

Hence \S\ < CtL^/"J where C = is a constant only depending on a. This contradicts 
the previous paragraph. □ 

We remark that a similar analysis shows that the simpler formula E{x, y) also has 
VC density [1 / aj in Ta ■ We chose ip as above because it allows us to compare Lemma HT^l 
with the main result of [4], where the precise value of the VC dimension of (as it 
depends on a) is computed. In particular, [4j Corollary 8] shows that if < a < ^ then 

[1/aJ + 3 < VC(ip) < [1/a + 3{a + 1)J . 

4.4. Shatter functions not growing like a power. We finish this section with two 
examples of VC classes definable in NIP theories whose shatter function is not asymp- 
totic to a real power function. 

4.4.1. The hypercube. Let Q be the "infinitary hypercube", i.e., the (symmetric, loop- 
less) graph whose vertex set is the set of all sequences s = (s„) in {0, 1}^ with finite 
support, with two sequences related by an edge iff they differ in only one component. 
(Alternatively, Q can be represented as the set of all finite sets of natural numbers, with 
an edge between them iff their symmetric difference is a singleton.) Note that Q is the 
increasing union Q — [J^^q Qd of its induced subgraphs Qd with vertex set 

V{Qd) = {s = (s„) G Q : s„ = for n > d}, 

which we may identify with {0, 1}'' in the natural way. (So Qd is the d-dimensional 
hypercube.) We construe Q as an £gr-structure. In [77] a condition sufficient for a 
(symmetric) graph to be stable is introduced, called superfiatness: a graph G is superflat 
if for every m there is some n such that no subdivision of the complete graph Kn on n 
vertices, obtained by placing at most m additional vertices on each edge, embeds into G. 
Note that the graph Q is not superflat: in fact, for every d there is an embedding of a 
subdivision of Kd+i, obtained by placing at most one additional vertex on each edge, 
into Qd, cf. [37]. However, we do have: 
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Proposition 4.10. Q is oj-stable. 

Towards a proof of tliis proposition, we first introduce some notation and terminology: 
Given a language £ let C{P) = £ U {P} where P is a new predicate symbol, and given 
an £-structure M and a subset A of its domain let (Af, A) be the expansion of M to 
an £(P)-structure obtained by interpreting P by A. The induced structure Ai^d on A 
is the structure whose language consists of an m-ary relation symbol for every C- 
formula ip{x), where m = \x I, interpreted in A-.nd by (p^ {M"^) n A". An £(P)-formula 
ip{x), where x = (xi, . . . , Xm), is said to be bounded if it has the following form (slightly 
abusing syntax): 

(p{x) = Oi2/i e P - • • OnVn e P^ixi, . . .,Xm,yi, ■ • • ,?/«), 
where Oi G {V, 3} and '0 is an £-formula. Casanovas and Ziegier have shown: 

Theorem 4.11. Let M be a structure in a language C and let A C_ M . 

(1) Suppose M is strongly minimal. Then in {M,A), every C{P)-formula is equiv- 
alent to a bounded formula. 

(2) Suppose that in {M,A), every C{P)- formula is equivalent to a bounded formula, 
and let A > |£| be a cardinal. If both M and A^d cife X-stable then {M,A) is 
X-stable. 

(See [18l Corollary 5.4 and Proposition 3.1]; part (1) had actually first been shown 
by Pillay [76].) 

Consider now, slightly more general than necessary, an arbitrary field K, and let 
= {0, +, (A- )a6k} be the language of K-vector spaces. Let M be an infinite- 
dimensional X-vector space. Then M, construed as an £i^-structure, has quantifier 
elimination and is cj-stable. Let A be a set of linearly independent elements of M. 
Then the induced structure on A is trivial: every subset of A™ definable in Ai^d is 
definable in the empty language. Hence by the theorem above, the £i<-(P)-structure 
(M, A) is w-stable. 

For the proof of Proposition 14.101 it now suffices to note that the £gr-structure Q 
is definable in {M,A), for suitable choice of K, M and A: Take K = ¥2 and let 
M — 0^ F2 a„ be a countably infinite F2-vcctor space with distinguished basis A = 
{a„ : n > 0}. Then Q is definable in {M,A): identifying V{Q) with M in the natural 
way, we have, for all vertices s, t of Q: (s,t) e E{Q) iS s — t £ A. Since {M,A) is 
w-stable, so is Q. □ 

Now consider the £gr-formula ip{x; y) :— E{x, y). Then 

= {{s, s'} ■.s,s'e g, (s, s') e EiQ)} 

is the collection of undirected edges of Q. In the following A denotes a subset of Q 
(unlike in the proof of Proposition I4.10p . Note that if A C then 

AnSiji ^ el) U E[A] where E[A] -.^ {{a,a'} : a,a' £ A, {a,a') £ E{Q)} . 

Since Qd has 2'' vertices and ^d2'^ undirected edges, we thus see that 7r^(i) > t+^t\ogt 
for infinitely many t; in fact: 

Proposition 4.12. 7r^(t) = ^tlogi (1 + o(l)) as t—> 00. 
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Proof. Set 

Edit) := niax{|i;[A]| : ACQ^, \A\ ^ t} for d > and t < 2'*. 

Then 7r^(i) — t + maxd>\iogt] Ed{t). It is known (see, e.g., [T]) that there is some 
function g with g{t) = ^tlogt (1 + o(l)) as t — ^ cx) such that = 9{t) for ah d and 

t<2'^. This yields the claim. □ 

4.4.2. An example in R = (K, 0, 1, +, — , x, <). For this we use another one of the rare 
examples (besides the Szemeredi- Trotter Theorem) where tight bounds on the number 
of incidences are known: 

Theorem 4.13 (Pach and Sharir [Zl])- Let a be a real number with < a < tt. The 
maximum number of times that a occurs as an angle among the ordered triples of t 
points in the plane is 0{t^\ogt). Furthermore, suppose tan(a) e (Q\/d where d e N 
is not a square. Then there exists a constant C = Ca > and, for every t > 3, a 
t-element set St C with the property that at least Ct^logt ordered triples of points 
from St determine the angle a. 

Let X = {xi,X2), y — (2/1,2/2), z — (zi,Z2) and consider the formula 

1^9(2;, y,z) :— X ^ y /\ X ^ z /\ 2{y — X , z — x) — ||j/ — x||||z — a;|| 

in the language of the ordered field of real numbers R, where ( , ) denotes the usual inner 
product on and 1 1 1 1 the associated norm. Then for a, 6, c G we have R \= (p{a, b, c) 
iff the vectors b—a and c—a are non-zero and the angle Z(6, a, c) between them is ^. Let 
(p{v; X, y, z) be the partitioned formula with object variables v — (wi, V2) and parameter 
variables {x, y, z) given by 

(p{v; x, y, z) :— ip{x, y, z) A [v = x V v = y V v = z), 
so Sip consists of all {a, b, c} £ ) with R \= (p{a, 6, c). We now have: 
Corollary 4.14. There exist constants Ci,C2 > such that 

Cit'^ \ogt < -Kipit) < C2t^ \ogt for every t > 0. 
That is, np{t) — 9(t^ logt) as t 00. 
Proof Let ^ C flnj^e. Then 

^ n 5^ = (^2) U |{a, b, c} e (3) : h V'ia, b, c)} . 

Since tan(7r/3) — \/3, the second part of Theorem 14.131 applies to a = The upper 
bound in CoroUarv 14.141 now follows from the first assertion in Theorem 14.131 and the 
lower bound from the second assertion in the same theorem. □ 

5. Theories with the VCd Property 

After defining the VC d property we prove that if a theory has this property then the 
dual VC density of any finite set of partitioned formulas in the tuple of object variables 
X is at most d\x\. In Section [6]we will then show that various theories have the VCd 
property. In the following M is a structure in a language £, A(x; y) is a finite non-empty 
set of partitioned £-formulas, and m = \x\. 
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5.1. Uniform definability of types over finite sets. Given a A(a;; i?)-type q € 
S^{B), where B C M^^^ a family J" = {(p^)ipeA of £(A/)-formulas (p#{y) is said to 
define q if for all ip d A and b d B we have 

^{x;b)eq ^ M^ip#{b). 

We also say that T is a definition of q. For a family T = J^{y;v) of partitioned 
£-formulas -0(2/; w), we denote by F{y;c) the family (-(/;(?/; c))^gjr of >C(A/)-formulas 
obtained by substituting a given tuple c € M'"' for the tuple of variables v. The 
following generalizes a definition due to Guingona |34) : 

Definition 5.1. We say that A has uniform definability of types over finite sets (ab- 
breviated as UDTFS) in M if there are finitely many families 

^■1 = {v^{y\yl.■■■,yd))^^^ 

of ^-formulas (where \yj\ = \y\ for every j — 1, . . . , d) such that for every finite set B C 
MI^I and g e S'^{B) there are bi, . . . ,bd & B and some i e / such that 6i, . . . , 6^) 
defines q. We call the family = a uniform definition of A{x; B) -types over 

finite sets in 7W with d parameters. If A = {ip} is a singleton, we also speak of ip having 
UDTFS. 

The following observation shows in particular that every finite set A(x; y) of parti- 
tioned £- formulas which is directed (see Example I3.12p has UDTFS in T with a single 
parameter: 

Lemma 5.2. Let A{x;y) be a finite set of partitioned C-formulas, and suppose the 
set system Sa — \}p{M^^^\b^ : b e Ai^'^'} has breadth d. Then A has UDTFS with d 
parameters. 

Proof. To see this set J-o{y) := (^^(z ^ z))^p^A and, for each d-tuple = ii'i, ■ ■ ■ , '4'd) G 
A'', define 

J^i^{y;yi,---,yd) {(p^{y;yi,...,yd))v&A 

where 

Vi,{y;yi, ■■■,yd) := Va;(V'i(a;;yi) A • • • A '4)d{x]yd) <f{x;y)). 
Then J- = {J^^)^,£{a}uA'' is a uniform definition of A{x; _B)-types over finite sets. For 
suppose q G S'^{B) where B C A/I^l is finite. If ip{x; b) q for all if e A, b e B, then 
To{y) defines q. Otherwise, by assumption we can pick Tpi{x;bi), . . . , ipd{x] bd) G q such 
that 

fl (^(Afl-I;6) = Vi(A/l"l;&i)n---nVd(Ml-l;6d). 

Then F^{y\ bi, . . . , bd), where ip = (V'l, . . . , ipd), defines q. □ 

A uniform definition of A(a;; i?)-types over finite sets in M remains a uniform defi- 
nition of A{x] -B)-types over finite sets in any elementarily equivalent structure, and so 
it makes sense to speak of uniform definability of types over finite sets in a complete 
theory. If we do not care about the number of parameters, we can always do with a 
single defining scheme (at least for non-trivial parameter sets): 

Lemma 5.3. Let J- — be a uniform definition of A{x; B)-types over finite sets 

with d parameters as above, and let n = \L\. Then there exists a family 

J"# = (v3#(y;?/i,...,?/d,w,w;i,...,m„)) 
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such that for every finite set B C A/'^' with > 2 and every q £ S^{B) there are 
61, . . . , &d, c, ci, . . . , c„ € B such that T^{u] 61, . . . , 6^, c, ci, . . . , c„) defines q. 

Proof. This is a simple coding trick due to Shelah (proof of Theorem II. 2. 12 (1) in [88], 
cf. also [Ml Lemma 2.5]). For every ip G A define 

n 

f#{yiyi,-- ■,yd,v,wi,...,Wn) -.^ /\(v ^w^-> ipt{y;yi, ■ ■ ■,yd)) 

4=1 

and let = (</5#)veA. Let B C mI^'I, \B\ > 2, and q G 5*^(5). By hypothesis, 
there are 61, . . . , 6d £ B and i G I such that Ti{y;bi, . . . , bd) defines q. Pick c,c' £ B 
with c 7^ c', and put Ci := c, Cj c' for j 7^ i. Then T^{y; bi, . . . ,bd,c,ci, . . . , c„) 
defines q. □ 

Similarly, the proof of Lemma 13. 181 (1) shows that if the £- formula V'A which we as- 
sociated there to the finite set of £-formulas A admits a uniform definition of ^Pa{x] B')- 
types over finite sets with d parameters, then A itself admits a uniform definition of 
A(x; i?)-types over finite parameter sets B (with at least 2 elements) having d + 2 
parameters. 

On the other hand, if we have tight control over the number d of parameters in our 
defining schemes, then we can bound the sizes of the A(a;; i?)-type spaces over finite sets 
by polynomial functions (in the size of the parameter set) of degree d: more precisely, if 
A allows a uniform definition — of A(a:; i?)-types over finite sets in M with 

d parameters, then \S^{B)\ < \I\ \B\'^ for every finite B C M\y\, hence TT*^{t) < \I\t'' 
for each t, and so vc*(A) < d. 

5.2. The YCd property. We say that M has the VCd property if any A(x;y) with 
\x\ — 1 has a uniform definition of A(x; i3)-types over finite sets with d parameters. 
Clearly, if M has the VCd property, then so does every elementarily equivalent C- 
structure. We say that a theory T has the VC d property if every model of T has the 
VC d property. 

We point out that the VC property only holds in a very special situation; recall 
that a structure is called rigid if it has no automorphisms besides the identity. 

Lemma 5.4. The following are equivalent: 

(1) M has the VCO property; 

(2) every model ofTh{M) is rigid; 

(3) M is finite and rigid. 

Proof. Suppose M has the VC property; to see (2), it suffices to show that M is rigid. 
Let (p{x;y) be the i2-formula x = y, and let T = {(pi{y))i^i be a uniform definition of 
ip{x; B)-types over finite sets. Suppose cr e Aut(M) and b £ M satisfy b 7^ cr{b). Let 
p — tp*'(6/i?) where B — {b, cr(6)}, and choose i £ I such that ipi defines p. Then 

b^b ^ Mh^,(6) ^ Mh<^4(^(6)) ^ b = a{b), 

a contradiction. This shows (1) (2), and (2) =^ (3) is obvious. Suppose now that M 
is finite, and let A{x;y), where \x\ = 1, be a finite set of partitioned /^-formulas. It is 
easy to see that then for eachp £ 5'^(mI^I) there is a family = {(pp{y) : f £ A} of £- 
formulas such that for all b £ M'^' we have M |= fpib) iff there is some automorphism a 
of M such that (p{x; b) £ <j{p). Hence if in addition M is rigid then F — {Fp)p^s^(M\y\) 
is a uniform definition of A{x\ i3)-types over finite sets in M . This shows (3) ^ (1). □ 
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The following result and its Corollarv l5.6l are useful if we have some kind of quantifier 
elimination result at hand: 

Lemma 5.5. Suppose A = A(x;2/) and 4* = ^'(x;?/) are finite sets of partitioned C- 
formulas such that every formula in A is equivalent in T to a Boolean combination of 
formulas in , and has UDTFS in T with d parameters. Then A has UDTFS in T 
with d parameters. 

Proof. We may assume that each G A has the form 

where R^, Sr.^p are finite index sets, Dr.s.ip is ^ or no condition, and ipr,s.^p G ^- Suppose 
G = {Qi)i£i is a uniform definition of '^[x] -B)-types over finite sets in T, where 

Qi = {ipiiyiVi, ■ ■ -^yd))^^^ for each i E I. 

Let J- — {J-i)i(zj where 

-^i = (V'ily;?/!, ■ • ■ ,yrf))^gA where = /\ \/ Dr^s.^^ {tpr.^s,^)i- 

Let q e S^{B) where B C M'^'I is finite. Let a e A/I^l realize q, and put p := tp*(a/B). 
Take i E I and bi, . . . ,bd & B such that Qi{y]bi, . . . ,bd) defines p. One now easily verifies 
that Ti{y;bi, . . . , bd) defines q. □ 

Corollary 5.6. Suppose <^ is a family of partitioned C-formulas in the single object 
variable x such that 

(1) every partitioned C-formula in the object variable x is equivalent in T to a 
Boolean combination of formulas from <&, and 

(2) every finite set of C-formulas from <i> has UDTFS in T with d parameters. 

Then T has the VC d property. 

The following theorem is at the root of the proof of Theorem 11.11 from the introduc- 
tion; it shows that having UDTFS with a constant number of parameters for all sets of 
formulas in a single object variable entails UDTFS with a linearly bounded number of 
parameters for sets of formulas in an arbitrary number of object variables: 

Theorem 5.7. Suppose that M has the YCd property. Then every l^{x\y) has a 
uniform definition of A{x; B) -types over finite sets in M with d\x\ parameters. 

Before we embark on the proof, we introduce some notation: for a sequence a € M™ 
and a set B C we write aB := {(a, b) : b G B} C M'"+" and Ba := {(&, a) : b € 
B} C M"+™. 

Proof. We proceed by induction on m = The base case m — 1 holds by hypothesis. 
For the inductive step write x — {xo,x') where x' = (xi, . . . ,Xm), and let A{x;y) be 
given. Let 

Ao{xo;x',y) = {(p{xo;x' ,y) : (p{x;y) e A}. 
By the YCd property applied to Aq, we take finitely many families 

J'i^ {ipi{x',y;yi,...,yd))^^^ {i e I) 
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of £-forniulas with the foUowing property: for any a' £ Af™, any finite set B C Afl*'' 
and any q £ S^°{a'B), there are bi, . . . ,bd & B and i £ I such that J^i{a' , y;bi, . . . , bj) 
defines g, i.e., for ah 1^9 e A, & € B: 

ip{xo;a',b) £ q ■^=^ M \= ipi{a' ,b;bi, . . . ,bd). (5.1) 

In the rest of this proof let (p range over A and i over /. For each i, let 

^i{x';y,yi, ...,yd) = {fi{x';y,yi, ...,yd)- f{x;y) £ A} 

and apply the inductive hypothesis to each A^. Thus for each i there are finite families 

•^u — {vtj{y,yi, . . . • • ■ >«n))^g^ (j e J^) 

of £-formulas, where n = md, such that for all finite subsets B C M'^', all b = 
{bi, . . . ,bd) £ (mI^I)'', and every p £ S'^' (Bb), there exists some j £ Ji and ci, . . . , c„ £ 
B such that for each (p and each b £ B we have 

(pi{x';b,b) £ p ■i=> M \^ (pij{b,b;ci,. . . ,Cn). (5.2) 

Partition the variable tuple of the £-formulas (pij as {y;yi, . . . , yd, wi, . . . , Vn), and set 
:= {J-ij)i^ij^j.. We claim that is a uniform definition of A.{x; i?)-types over finite 
sets in M; since JF has d + n = d{m + 1) parameters, this will then finish the inductive 
step. To see this, let a finite B C Af'^' and some a = (ao,a') £ A/^+™ be given. We 
let b range over B. We need to show that there are some i £ I, j £ Ji and b £ B'^, 
ci, . . . ,Cn € B such that for all ip and b we have 

M\=ip{a;b) ^ M 1= (p,j(fe;S,ci,...,c„). (5.3) 

Let q = tp'^° {ao / a' B) be the type in S^"{a'B) realized by oq. Take i and b = 
(61, ...,bd) £ B'^ such that holds for aU (p and all b. Set p = tp'^^ {a' / Bb) . Then 
we may take j £ Ji and ci, . . . ,Cn € B such that for each ip and each 6, the equivalence 
dO]) holds. This yields dSSJ, for all and b. □ 

In the next corollary we assume that M is infinite (so we can meaningfully talk about 
VC density). We already remarked that if A admits a uniform definition of A{x;B)- 
types with d parameters, then vc*(A) < d; in particular, if M has the YCd property 
then this conclusion holds for all A(x; y) with \x\ — 1. The previous theorem generalizes 
this observation to the upper bound vc*(A) < d\x\ for all A. Hence: 

Corollary 5.8. If T = Th(7Vf) has the YCd property then m < vc^{m) < d ■ m for 
every m. In particular, if T has the VC 1 property then vc^{m) — m for every m. 

In a multi-sorted setting, the VC d property can be naturally localized. Suppose M 
is a multi-sorted structure, and let S be one of the sorts of M . We say that M has the 
VC d property in the sort S if any A(a;; y) with x a single variable of sort S has a uniform 
definition of A(a;; i?)-types over finite sets with d parameters. With this definition, the 
following analogue of Theorem 15.71 holds (with the same proof) : 

Corollary 5.9. If M has the VC d property in the sort S , then every A(a;; y) with each 
variable Xi of sort S has a uniform definition of A{x; B)-types over finite sets with d\x\ 
parameters. 

So for example, if M is an infinite single-sorted structure and some expansion of 
M'^'^ has the VC d property in the home sort (where the uniform defining formulae are 
in the expanded language), then vc"^(m) < dm for each rn. 
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5.3. Relationship to other notions. We now want to put the YGd property into 
perspective and compare it to two otlier strengthenings of the NIP concept, namely, 
uniform definability of types over finite sets and dp-minimality. The following notion 
was introduced and studied in [34 : 

Definition 5.10. The structure M is said to have uniform definability of types over 
finite sets ( UDTFS) if every partitioned ^-formula has UDTFS in M. Clearly UDTFS 
is an invariant of the elementary theory of M , and so we say that an >C-theory T has 
uniform definability of types over finite sets {UDTFS) if every model of T does. 

By Theorem 15. 7[ if M has the VCd property, for some d, then M has UDTFS. 
More generally, |34i Lemma 2.6] shows that if every partitioned £-formula (p{x;y) with 
\x\ = 1 has UDTFS in M, then M has UDTFS. The proof of this lemma as given 
in [34] in fact shows that if every partitioned £-formula with a single object variable 
admits a uniform definition of ip{x] _B)-types with d parameters, then every partitioned 
iZ-formula in the object variables x admits a uniform definition of f{x; -B)-types with at 
most (d+ 1)1^1 — 1 parameters. (In contrast, our bound in Theorem 15. 71 is linear in \x\.) 

Every stable formula has UDTFS; in particular, every stable theory has UDTFS. In 
fact, Laskowski ^56j has shown that if T is stable then every partitioned £-formula (p{x; y) 
has UDTFS in T with R"^{x = x,ip,2) parameters. (See [H Definition II.l.l] for the 
definition of the rank i?™(— , — , 2). Stability of T is equivalent to R™{x — x,(p,2) < uj 
for all (pix;y), cf. [88l Theorem II.2.2].) 

An ICT pattern in M consists of a pair a{x;y), f5{x;y) of partitioned >C-formulas, 
where \x\ = 1, and sequences (ai)igN, {bj)jeN in -M'^' such that for all i and j the set of 
£(Af )-formulas 

{a{x;ai),l3{x;bj)} U {^a{x;ak) : k ^ i} U {^l3{x;bi) : I ^ j] 

is consistent (with M). This notion and the following definition originate in [90] : 

Definition 5.11. An £-thcory T is said to be dp-minimal if in no model of T there is 
an ICT pattern, and M is dp-mininmal if Th(iW) is dp-minimal (equivalently, if there 
is no ICT pattern in an elementary extension of M). 

The following proposition (which shows that in the definition of dp-minimality we 
could have restricted ourselves to ICT patterns given by identical formulas a, j3) and 
its Corollary 15.131 are due to Dolich, Goodrick and Lippel [23] ; for convenience of the 
reader we indicate their proofs: 

Proposition 5.12. Suppose M is a monster model of the complete C-theory T . Then 
T is dp-minimal iff there are no C-formula (p{x;y) with \x\ = 1 and sequences (ci)igN, 
(c?j)jgN in M'^I such that for all i, j , 

{(p(x;Ci),Lp(x;dj)} U {^(p{x;ck) : k ^ i} U {^Lp{x;di) : I ^ j] 

is consistent. 

Proof. Suppose a{x; y), j3{x] y) (where |a;| = 1) and the sequences (ai)igN, {bj)j&i are an 
ICT pattern in Af. Let ip{x]y,z) := a(a;; j/)V/3(a;; z), and for every z, j let Ci := (021,^21) 
and dj :— (a2j+i, b2j+i). Let a Cz M realize the type 

{a{x;a2^),^3{x;b2J+l)} U {^a{x;ak) : k ^ 2i} U {^P{x;bi) : I ^ 2j + l}. 

Then a satisfies 'fi{x; ci) and (p{x; dj). li k ^ i then a satisfies ^a{x; a2k) (since 2k ^ 2i) 
and -'fi{x;b2k) (since 2k ^ 2j -\- 1) and hence also ^Lp{x;ck). Similarly we see that a 
satisfies -'ipix; di) for I ^ j. □ 
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Let us tentatively say that a complete £-tlieory T is vc-minimal if vc*{ip) < 2 for 
every £-formula (p{x;y) with = 1. So if vc-^(l) < 2 (in particular, if T is VC- 
minimal), then T is vc-minimal. An example of a theory T which is not VC-minimal 
yet satisfies vc-^(l) = 1 (and thus is vc-minimal) was given in [24l Proposition 3.7]. 

Corollary 5.13. Every vc-minimal theory is dp-minimal. 

Proof. Suppose M is a monster model of T = Th(iVf), and T is not dp-minimal. 
Take ip and (c^), (dj) with the properties in the previous proposition. For every n let 
Bn ■■= {ci:i<n}U {dj : j < n}. Then |S"^(S„)| > > i|B„|2, hence vc*{ip) > 2. □ 

In particular, each of the theories in Examples I3.8H3.10] (being VC-minimal) is dp- 
minimal. Other proofs of the dp-minimality of weakly o-minimal theories can be found 
in m [25 . See also [IB] for a generalization of Corollary 15.131 to a bound on "dp-rank" 
in terms of VC density. 

A characterization of dp- minimal theories among stable theories was given in |71| . 
The main result of |34) is that every dp-minimal theory T has UDTFS. In particular, 
by Corollarv l5.131 every vc-minimal T has UDTFS. (Actually, [34, Theorem 3.14] gives 
a more precise result: if (p{x; y) is an £-formula such that 'Ktp{t) < (*^^) for some i > 0, 
then ip has UDTFS.) 

We summarize the implications between the properties of a theory T discussed above 
in the following diagram: 



VC-minimal 

Here the arrows marked with an exclamation mark are known not to be reversible. 
(For an example showing that vc(l) = 1 7^ VC 1 see [5J Example 3.15].) We do not 
know which of the other arrows are reversible; whether the converse of the implication 
UDTFS ^ NIP holds was first asked by Laskowski [34l Open Question 4.1]. 

Recall from Corollary 13.201 that the Shelah expansion M^^ of M has the same 
VC density function as M . In [71] it is observed that the Shelah expansion of a dp- 
minimal structure is again dp-minimal. We finish this section by showing that the 
analogous statement also holds for the VCd property; in fact, we have more generally: 

Proposition 5.14. Suppose every finite set of partitioned C-formulas in m object vari- 
ables has UDTFS in T with d parameters. Then every finite set of partitioned C^^- 
formulas in m object variables has UDTFS in T^^ with d parameters. 

Proof. As in the definition of the Shelah expansion (cf. Section 13. 5|) let M* be a very 
saturated elementary extension of M. Let A = A{x; y) be a finite set of partitioned C^^- 
formulas where m = \x\; below (p ranges over A. We need to show that A has UDTFS 
in T^^ with d parameters. For this, by Lemma 15.51 and since T^^ admits quantifier 
elimination, we may assume that each of the £^^-formulas in A is atomic; that is, there 
exist >C-formulas il^,p{x, y; z), one for each tp, and a tuple c G (Af*)'^' such that each tp 



VCl 



^ VC d for some d > 




VC 




^ vc-minimal 



^ dp-minimal =^ UDTFS => NIP 
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has the form ip{x; y) = R^^^dx; y). Let now y, z) {ipip{x; y,z) : ip £ A} and take 
a uniform definition Q = (G)iei of '^'{x; _B*)-types over finite sets in T^^, where 

= {{ip^)t{y,z; (yi,zi),. . . , {yd,Zd)))^^^ for each i e /. 

For each i e / set 

:= (i?^.,c(y;2/i,...,yd))^g^ 

where (p^{y, yi, . . . , yd] z) := {ip^)t{y, z, (yi, z),..., {yd, z)). 

We claim that J- = {J-i)i£i is a uniform definition of A(a;; i?)-types over finite sets in 
T^h_ To see this let p e S'^{B) where B C mI^^I is finite, and let a e A/™ be a realization 
of p in M^h. Put B* := B X {c} C Afl^'l x (M*)l^l and let p* := tp*(a/B*) (in M*). 
Take bi, . . . ,bd S B and i € / such that 5i(y, z; (&i, c), . . . , c)) defines p*; then for 
every (p and 6 e _B we have 

^ M* h (V'v')^(^c;(6i,c),...,(6d,c)) 
That is, T,{y;bi,..., hd) defines p (in M'^'^). □ 



6. Examples of VCd: Weakly O-minimal Theories and Variants 

In this section we apply Theorem 15.71 from the preceding section to give a proof of 
Theorem 11.11 on VC density in weakly o-minimal theories from the introduction. We 
also observe that a similar technique allows us to treat all (weakly) quasi-o-minimal 
theories. 

Throughout this section iZ is a language containing a binary relation symbol "<" and 
T is a theory extending the theory of infinite linear orderings. 

6.1. Weakly o-minimal theories. We begin by introducing some terminology con- 
cerning ordered sets. Let (X, <) be a linearly ordered set, and let 5 be a subset of 
X which is a union of finitely many non-empty convex subsets of X. We refer to the 
convex sets in the unique minimal such presentation of S as its [convex) components. 
Suppose S has N convex components, where iV > 0. These components are ordered 
by <, so for i — 1, . . . , N we can refer to the ith component of S] for i > N we declare 
the ith component of S to be equal to the iVth. 

Recall that T is called weakly o-minimal if for any M \= T, any definable subset of 
M is a finite union of convex subsets of M. 

Theorem 6.1. Assume that T is weakly o-minimal. Then T has the VC 1 property, 
and hence any finite set A{x;y) of C-formulas has dual VC density at most \x\. 

Proof. Let M |= T. Fix a finite non-empty set of ^-formulas A{x;y) with |a;| = 1. We 
let ip range over A and b over M'^L By the weak o-minimality of T, there is an integer 
> such that for any ip and any b, p{M ; b) has at most N components. For any p 
and i G [N] there is an £-formula p'^{x; y) such that for every b with p>{M] b) ^ 0, the 
ith component of p{M ; b) equals p^{M\ b), and such that for every b with p{M; 6) = 
we have ip\M;b) = 0. So 

p{M;b) ^ p\M;b)U---Up'^{M;b) for every 6. (6.1) 
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Set 

f<{x;y) ■■= 3xo{(p\x();y) Ax < xq), 
f<{x;y) ■■^yxo{ip\xo;y) ^ x < xq). 

Then clearly 

ip'{M; b) = ip<{M; b) n (M \ Lp'^{M; b)) for all (p, & and i G [N]. (6.2) 

Now set 

^{x- y) := {^^(x; y) : ^ € A, ie[N], □ G {<, <}}. 

For each -0 S and each &, the set ip{M;b) is an initial segment of M; hence 5* 
is directed, so 4* has UDTFS with a single parameter, by Lemma 15.21 Moreover, by 
(|6.ip and (|6.2p . every (/3 is equivalent to a Boolean combination of 2N formulas from ^. 
Hence A also has UDTFS with a single parameter, by Lemma 15.51 Thus M has the 
VC 1 property. By Corollary 15.81 therefore vc*(A) < |a;| for every finite set A{x;y) of 
iZ-formulas. □ 

Remark. In the previous theorem we assume that the theory T is weakly o-minimal 
(i.e., all models of T are weakly o-minimal). Recall that a weakly o-minimal structure 
need not have weakly o-minimal theory. We do not know whether the conclusion of 
Theorem 16.11 holds if T is merely assumed to have some weakly o-minimal model. 

Put £div,< Cdiv U {<}, where £div — {0, 1,+, — , x, | } is the language of rings 
expanded by a divisibility predicate (see Example 13.101 above) and "<" is a binary 
relation symbol. Let RCVF denote the theory of real closed fields equipped with a 
proper convex valuation ring, parsed in the language i2div,<- The following corollary is 
now immediate, as by [23], RCVF is weakly o-minimal. 

Corollary 6.2. Let K \= RCVF. Then any finite set A(x;?/) of C(iiv,<-formulas has 
dual VC density at most \x\ in K . 

This result in turn yields a VC density bound for algebraically closed valued fields of 
residue characteristic (which is non-optimal by Example I3.10|) : 

Corollary 6.3. Let ACVF(o.o) be the theory of non-trivially valued algebraically closed 
fields of residue characteristic 0, in the language £div Let \{x\y) be a finite set of 
Cdiv -formulas. Then A has dual VC density at most 2\x\ in ACVF(o,o)- 

Proof. The theory ACVF(o,o)i which is complete, is interpretable in RCVF: if ii' is a 
model of RCVF, then its algebraic closure K^^^ is a degree 2 extension of K: i^T^'^ = 
K{i), where — —1. So i^^'^ can be identified with K^, and the valuation v oi K can 
be definably extended to one of K'^^^ by setting v{a + bi) = ^v{a'^ + b"^) for a,b E K. 
Thus, by Lemma r3.16l and CoroUarv 16.21 A{x; y) has dual VC density at most 2\x\. □ 

We have no results on VC density for ACVF in characteristics other than (0, 0). 

6.2. Quasi-o-minimal theories. We now turn to quasi-o-minimal theories: T is said 
to be quasi-o-minimal if for any M \= T, any definable subset of M is a finite Boolean 
combination of singletons, intervals in M, and 0-definable sets. (See |14j.) 

Theorem 6.4. Assume that T is quasi-o-minimal. Then T has the VC 1 property, and 
hence vc'^{n) = n for each n. 
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Proof. Let M |= T. Fix a finite set A{x;y) of £-formulas with. |a;| = 1; we let cp 
range over A and b over M\y\. There is some positive integer N and 0-definable subsets 
Di, . . . ,Dn oi M so that for any ip and any choice of parameters b, the set (p{M; b) of 
reahzations oi (p{x; b) is a Boolean combination of the and at most N singletons and 
intervals in M [TH Theorem 3]. 

Let T) be the collection of sets of the form Di H ■ ■ ■ D D^, where Di is either Di or 
its complement in M (so I? is a partition of M into at most 2^ sets). We let D range 
over v. For every D, ip, and b, the set D n (p{M; b) is then a finite union of at most N 
convex subsets of the ordered set D. For every i e [N] and every D let ^''^ix;u) be 
an £-formula such that for every b, if the set D n (p{M; b) is non-empty, then the ith 
convex component of D n f{M\ b) (viewed as a subset of the ordered set D) is given by 
ip'^^iM; b), and if L> n (p{M;b) = then ip''^{M;b) = 0. Hence for each (p and b we 
have 

p>{M;b)= U ^^-^{M-b). 

Dev, ielN] 

Now let (slightly abusing syntax) 

ip''^{x;y) -.^ X e D A 3xo(xq e D A ip>''{xQ;y) Ax < xq), 
ip''^ [x] y) -.^ X E D A yxo{xQ E D A ip^ixo; y) x < xq). 

Then 

p''°{M; b) = f'fiM; b)n{M\ if"f{M- b)) for all ^p, b, D and i S [N]. 

Each set ip^^ {x; b), where □ € {<, <}, is an initial segment of D, and any two distinct 
elements of V are disjoint. Thus the set system S^jj, where 

^{x;y) = {plfix;y):ipeA, i e [N], □€{<,<}, D e V} , 

is directed. As in the proof of Theorem 16.11 it now follows that A has UDTFS with a 
single parameter. □ 

Corollary 6.5. The following structures all have the VC 1 property: 

(1) (M, <,Q) (i.e., the ordered set of reals with a predicate for the rationals); 

(2) (Z",<,+) where < is the lexicographic ordering on Z"; 

(3) (Z" X Q, <, +) where < is the lexicographic ordering on Z" x Q. 

Proof. Each of the examples has quasi-o- minimal theory: For (1) this was noted in [HI 
Section 1], and for (2) and (3) this is proved (based on a quantifier-elimination result 
from [9^) in [151 Theorem 15]. The corollary now follows from Theorem 16.41 □ 

Remark. The ordered abelian groups in (2) and (3) of the previous corollary are typical 
for quasi-o- minimal groups. Here and below, "quasi-o-minimal group" means "quasi- 
o-minimal expansion of an ordered group." (A quasi-o-minimal group is necessarily 
abelian |14[ Theorem 11].) An expansion G of an ordered group is called co set- minimal 
if every subset of G definable in G is a finite union of cosets of definable subgroups, 
intersected with intervals. A theory expanding the theory of ordered groups is said to 
be coset-minimal if all its models are. (See [15.,.) Now by [79l Theorem 5.3], the theory 
of G is coset-minimal iff the theory of the expansion of G by constant symbols for the 
elements of G is quasi-o-minimal, and in this case G is an expansion of an ordered group 
elementarily equivalent to either (Z", <, +) or (Z" x Q, <, +), for some n. 
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Part (2) of the previous corollary shows in particular that Presburger Arithmetic, 
i.e., the theory of the ordered group (Z, <, +) of integers, has the VC 1 property, and 
hence is dp-minimal. By Theorem 10 in [14 , (Z, <,+) has no proper quasi-o- minimal 
expansions. One can strengthen this statement: 

Proposition 6.6. No proper expansion of (Z. <, +) is dp-minimal. 

The proof is the same as in |T3j , replacing the use of [TH Theorem 7] by a result from 
[HI]; we state the latter employing some convenient terminology from [H]: Let {X, <) 
be a linearly ordered set. We say that two subsets S, T of X are eventually equal (in 
symbols: 5 « T) if there is some a ^ X such that S D {a, +oo) = T n (a, +oo). Clearly 
w is an equivalence relation on subsets of X. We say that a family of subsets of X is 
eventually finite if it is partitioned into finitely many classes by «. 

Lemma 6.7 (Simon 92, Lemma 2.9]). Suppose T is dp-minimal. Let M \= T and let 

ip{x;y) be a partitioned C{M) -formula where \x\ ~ 1. Then is eventually finite. 

For the benefit of the reader we now indicate the details of the proof of Proposition l6.6l 
Let Z he a. proper expansion of (Z, <, +). By a theorem of Michaux and Villemaire [55] 
(and an easy extra argument, given in the proof of [131 Theorem 10]), there is a subset 
[/ of Z which is definable in Z but not definable in (Z, <, +). By Simon's lemma, the 
family {a -\- U : a G Z} is eventually finite; thus the subgroup A of Z consisting of all 
a G Z such that a -\- U fa U is non-zero, so A = aZ for some positive integer a. Let V 
be the union of all cosets of A which contain arbitrarily large elements of U; then V is 
definable in (Z, <, +), hence it suffices to show that U V. As a -I- [/«[/, we can take 
a G Z such that for every m € Z with u > a we have a -\- u Cz U u £ U. One now 
proves easily that for every u £ Z with u > a we have u €z U u (z V. □ 

So for example, the expansion of (Z, <, +) by the set — {6" : n > 0} of powers of 
a natural number > 1 is not dp-minimal, as is the expansion of (Z, <, +) by the set of 
factorials or by the set of Fibonacci numbers. In all these examples, the corresponding 
expansion of (Z, <,-|-) has quantifier elimination in a natural expansion of {<,-\-,U} 
(see [inilTS]) and is NIP (as will be shown elsewhere). 

6.3. Weakly quasi-o-minimal theories. In [S3], T is called weakly quasi- o-minimal 
if for any M |= T, any definable subset of M is a finite Boolean combination of convex 
subsets of M and 0-definable sets. Every weakly quasi-o-minimal theory is NIP [53l 
Theorem 2.3]. In fact, the proof of Theorem 16.41 (mutatis mutandis) also shows more 
generally: 

Theorem 6.8. All weakly quasi-o-minimal theories have the VC 1 property. 

This observation can be used to strengthen [92, Proposition 4.2], where it is shown 
that the complete theories of colored linearly ordered sets with monotone relations 
(shown to be NIP in [85]) are dp-minimal. A binary relation i? on a set X is said to be 
monotone with respect to a linear ordering < of X if 

x' < xRy < y' x'Ry' for aU x, x' , y, y' S X. 

A colored linearly ordered set with monotone relations is a structure of the form M = 
{M, <, {Ci}ig/, {Rj}j^j) where < is a linear ordering on M, the Ci are unary predicates 
("colors"), and the Rj are binary relations which are monotone (with respect to <). It 
was shown by Simon [92l Proposition 4.1] that every such colored linearly ordered set 
with monotone relations has quantifier elimination provided that each 0-definable subset 
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of M is given by one of the predicates Ci and each monotone 0-definable binary relation 
is given by one of the Rj. 

Proposition 6.9. Let M he a colored linearly ordered set with monotone relations as 
above. Then T = Th(Af) is weakly quasi- o-minimal, and hence has the VC 1 property. 

Proof. By the result of Simon just quoted, we may assume that M admits quantifier 
elimination. Now for each b G M and j € J the set 

{x e Af : M h xRjb} 

is an initial segment of Af, and 

{x e Af : M 1= bRjx} 

is a final segment of M (i.e., its complement is an initial segment of Af). Hence any 
definable subset of M is a finite Boolean combination of initial segments of M and 
0-deflnable sets, so T is weakly quasi-o-minimal. □ 

As in [85l [92] this leads to a result for (partially) ordered sets of finite width. Let 
P — (P, <) be an ordered set, i.e., a set P equipped with an irreflexive, asymmetric and 
transitive binary relation < on P. A subset ^ of P is an antichain if for all a ^ a' in 
A, neither a < a' nor a' < a holds, and C C P is a chain if for all c 7^ c' in C, either 
c < c' or c' < c. The width of P is defined to be the supremum of the cardinalities of 
antichains in P, and denoted by width(P). (Dually, the height of P is defined to be 
the supremum of the cardinalities of a chain in P, denoted by height(P).) A colored 
ordered set is a structure P — (P, <, {Ci)i^i) where (P, <) is an ordered set and each 
Ci is a unary predicate. 

Corollary 6.10. Let P = {P^ <,{Ci)i<^i) he an infinite colored ordered set of finite 
width. Then YC^^^^\m) = m for every m. 

Proof. Let n ~ width(P) and let i, j range over [n]. By Dilworth's Theorem there is a 
partition P = Pi U • • • U P„ of P into disjoint chains Pi. Define a linear ordering ^ on P 
by setting a ^ & iff either a,b € Pi for some i and a < b, or a E Pi, b € Pj with i < j. 
For all i, j the binary relation 

R^J := {(a, &) e P : 3a' e P^, b' £ P^ : a ^ a' < b' ^ b} 

is monotone with respect to -< , and the original ordering < is 0-definable in the linearly 
ordered set with monotone relations (P, -<, {Rij)i,j), noting that a < & iff aRaoRijbRjjb 
for some i and j. The claim now follows immediately from Proposition 16.91 □ 

Question. Is every ordered set of finite width VC 1? 

By an interpretability argument, the previous corollary also leads to a (perhaps non- 
optimal) bound on the VC density for those distributive lattices with NIP theory. By [85l 
Theorem 6] these are exactly the distributive lattices of finite breadth. From Section [2^ 
recall that a semilattice (i. A) has breadth at most d if for all &i, . . . , 6^+1 G L there is 
some i G [d+1] such that 61 A • • • A bd+i = bi A ■ ■ ■ bi ■ ■ ■ A bd+i, and the smallest such d 
(if it exists) is called the breadth of (P, A). 

Corollary 6.11. Let L — (L, A,V) be an infinite distributive lattice of breadth d. Then 
.^i^Th{L)^^^ < dm for every m. 
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Proof. Let P — {P, <) be an ordered set of width d, and let A{P) be the set of antiehains 
of P (so each element of A{P) is a subset of P of size < d). For A, A' e A{P) let AAA' 
denote the set of minimal elements oi AU A' and AV A' the set of maximal elements of 
AUA'; then AAA', AvA' e A{P), and {A{P), A, V) is a distributive lattice of breadth d. 
Moreover, one can choose P such that the given distributive lattice L is isomorphic 
to A{P) j84| Theorem 3]. Since A{P) is interpretable in P on a definable subset of P"^, 
we have vc"'"'^^-'^-' (m) < vc^'^'^^'' (dm) by Corollarv 13.171 and hence vc"""^^-^-* (m) < dm by 
the previous corollary. □ 

In [92] it is shown that the complete theory of each infinite tree T (viewed as an 
ordered set) is dp-minimal. Here, a tree is an ordered set T = (T, <) with the property 
that for each t E T the set {t' £ T : t' < t} is linearly ordered (by the restriction of <). 

Problem. Determine the VC density function of each (infinite) tree. 

(It is known [75] that a tree T is stable iff T has finite height, and then T is superstable 
of U-rank < height(r), so conceivably, the methods of ^ could be applied.) 



7. A Strengthening of VC d, and P-adic Examples 

In this section we first introduce a strengthening of the VC d property defined and 
studied in Section [S] and we prove a more precise version of Theorem 15.71 for strong 
VCd structures. The extra precision afforded by this theorem is useful in situations 
where vc(l) = 1, yet we can only prove the YCd property for some d > I. This is the 
case for P-minimal theories, which are discussed in the last subsection, where we prove 
Theorem 1 1 . 2 1 from the introduction. 

7.1. The strong VCd property. In the following M is a structure in a language C 
and A = A{x;y) is a finite non-empty set of partitioned £-formulas. Let T = {J-i)i,=i 
be a uniform definition of A(x; i?)-types over finite sets, where 

Recall from Definition 15.11 above that this means the following: for every finite set B C 
M'^I and q e S'^{B) there are 6i, . . . , 6^ e i? and some i e / such that J-i{y; &i, . . . , b^) 
defines q. If in addition for every choice of 6i, . . . , 6^ e Af'^' and i e /, the set 

p,{x;bi,...,bd) ■.^{ip{x;b):ipeA, 6eAfl«l, M |= ^,(&; &i, . . . , 

of £(Af )-formulas is consistent (with M), then we say that is a coherent definition 
of A{x; B)-types over finite sets. (In this case, every restriction of pi{x;bi, . . . , bd) to a 
finite parameter set B C M^^^ extends to a complete A(x; B)-type, but J^i{y\ &i, . . . , bd) 
does not in general define such an extension.) 

Remark. Often all our defining formulas have the syntactic form 
'Pt{y;yi,---,yd) ^yx{xi{x;yi,...,yd) v{x;y)) 

where Xi is an ^-formula. In this case, the coherency condition for J- is automatically 
satisfied provided Xi^{x]b) ^ for all i € / and b € (Af'^')''. For example if 5a has 
breadth d and is d-consistent, then A has a coherent definition of A(a;; i?)-types over 
finite sets with d parameters. (See Lemma [521) 
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We say that M has the strong VC d property if any A{x;y) with |x| = 1 has a coherent 
definition of A(a;; _B)-types over finite sets with d parameters. Clearly, the strong YCd 
property is a property of the elementary theory of M. We say that a theory T has the 
strong VC d property if every model of T has the strong VC d property. 

Remark. Suppose |a;| = 1 and J- = {J-iji^i is a coherent definition of A(x; -B)-types 
over finite sets in iW, where J-i ~ iy^i)<peA- Then for every A' C A, the families 
:— {ipi)ip£A' form a coherent definition of A'{x; i?)-types over finite sets in M. This 
shows in particular that in order to check that M has the strong VC d property, one 
may restrict oneself to sets of /^-formulas A which are closed under negation. 

In the rest of this subsection we assume that M is infinite. We have the following 
result on counting types in structures with the strong VC d property: 

Theorem 7.1. Suppose that M has the strong YCd property, and Zei r G M such that 
TT'^{t) ~ 0{t^) for every A{x;y) with \x\ = 1. 

Then we have 

nlit) = 0(t'^(l^l-i)+'-) for every A{x; y). 

The proof of Theorem 17. II proceeds by counting types via sets of representatives (and 
with an induction supported by the strong YCd property): given a finite set B C Afl^l, 
we say that R C M'^' is a set of representatives for S^{B) if for every q £ S'^{B) there 
is a € R realizing q. Equivalently, for every a G Af'^' there is a e i? such that for every 

e A and every b e B, M \^ ip{a; b) if and only if M ^ ip{a; b). Thus \S^{B)\ < K 
iff there is a set of representatives for S^{B) of size at most K. 

Proof. The proof is similar to that of Theorem 15.71 We again induct on m = |a;|, with 
the case m — 1 holding by hypothesis. For the inductive step write x — {xo,x') where 
x' = and let A(x;y) be given. We may assume that A is closed under 

negation. As in the proof of Theorem 15.71 let 

AQ{xo;x',y) = {<f{xo;x',y) : <f{x;y) E A}. 

By the strong VC d property applied to Aq , we can take finitely many families 

J', = {^p^{x',y;yl,...,yd))^^^ {i e I) 

of ^-formulas with the following two properties: for any a' € M™, any finite B C Af 
and any q £ S^°{a'B), there are bi, . . . ,bd £ B and i £ I such that J^i{a' ,y;bi, . . . , bd) 
defines g, i.e., for aW ip £ A, b £ B: 

ip{xo;a',b) £ q <^=^ M \^ ipi{a ,b;bi, . . . ,bd); (7.1) 

and for all i£ I, a' £ A//™ and b £ (Af I"!)'', the set 

Pi{xo;a',b) := {ip{xa-a',b):if£A, b £ M^y^ , M ^ ip,{a' Mb)] 

of £(A/)-forniulas is consistent (with M). In the rest of this proof let tp range over A 
and i over /. For each i, let 

Aj(a;';y,yi,...,yd) = {ipi[x' \y,yi, . . . ,yd) ■ v{x;y) £ A] 

and apply the inductive hypothesis to each A^. Thus there are constants Kt such that 
for any finite C C (Afl*'l)('*+^) there is a set of representatives for S'^^{C) of size at most 

_^,|(^|rf(m-l)+r^ 



VC DENSITY IN SOME NIP THEORIES, I 



49 



Now let a finite B C M'^' be given. We let b range over B and b = (&i, . . . , 6^) over 
B'^. For each b and each i, let Ri{Bb) be a set of representatives for S^^{Bb). Thus for 
any a' e Af™ and i there is some a G Ri{Bb) such that for any ip and 6, 

M h<^^(«';6,S) ^ M \= ^,{a;b,b). (7.2) 

Notice that there are \B\'^ sequences b, and \Bb\ — \B\ for each b. As above, we may 
suppose \R,{Bb)\ < Ki\Bb\'^'^"'-^^+'' = K^\B\'^'^"'-^^+'^ . 

For each i, given a G Af™ and 5, let (5^ G Af realize the restriction of the type 
Pi{xo] a,b) to the (finite) parameter set aB. Let 

- {i^^c^.b^a) ■■ieI,beB'',ae R^{Bb)} 

and observe that 




Thus we are finished once we have shown that i?A is a set of representatives for S {B). 
For this, let a = {ao,a') G Af^+™ be given; we need to show that there is an element 
((5j ai^a) G i?A such that for every ip and every 5, 

Mh^(a;&) ^ Mh^('5.,„,b,a;&)- (7.3) 

Let g = tp^°{ao/a'B) G S"^°(a'B). Take i and 6 such that ((7?T|) holds for all and aU 
6, and then take a representative a G Ri{Bb) for tp^* (a'/i?6). Note that by (|7.ip and 
since A is assumed to be closed under negation, for each (p and 6, either A/ ^ ipi{a' , b; b) 
or Af 1= i/ji^a' ,b;b), where ip G A is equivalent to -^ip; hence also either M \= pi{a;b,b) 
or M ^ b,b), by ([7^ . and thus 

M^^,{a;b,b) ^ M^^{S^^^j;a,b), (7.4) 

by choice oi ^-j^. Combining (|7.ip . (17. 2p and (17. 4p now yields (|7.3I) as required. □ 

In each of the cases treated in Theorems 16.11 and 16.41 one can show that the theory 
in question has, indeed, the strong VC 1 property. Since this is of limited interest for 
computing VC density (the VC 1 property already gives the optimal result vc(m) = m), 
we do not give the details, and instead now turn to an application of Theorem 17.11 to 
p-adic examples. 

7.2. P- minimal theories. Let £iings be the language of rings, let Cp — £iings U {P„ : 
n > 1} where the P„ are unary predicates, and let £ be a language containing Cp. Here 
and below, p is a fixed prime number. Let pCF denote the £p-theory of Qp, where each 
P„ is interpreted as the set of nth powers in Qp: 

By a theorem of Macintyre j^O], pCF has elimination of quantifiers. Following [39], an 
£-theory T containing pCF is called P -minimal if, in every model of T, every definable 
subset in one variable is quantifier-free definable just using the language Cp. (In fact, 
the setting of [35] also allowed for p-adically closed fields of arbitrary fixed p-rank, and 
our methods here could be adjusted to that.) 

By [26], a motivating example of a P- minimal theory is the theory pGF^^ first in- 
vestigated by Denef and van den Dries [35]. This is the theory of the p-adic numbers 
equipped with, for every n > and power series ai.X" G Qp[[X]] such that \au\ — >■ 
as 1 1^1 oo, a function symbol / of arity n taking value identically zero off Z^, and 
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such that f{x) — '^^a^x'^ for all x £ Z^. (Here, X — {Xi, . . . ,Xn), \a\ denotes the 
p-adic norm of a G Qp, v — (z^i, . . . , v^) € N" is a multi-index, \i>\ = t^i + • • • + i>n, and 

Our main result about VC density in P-minimal theories is: 

Theorem 7.2. Let T be a P-minimal C-theory with definable Skolem functions. Then 
T has the strong VC2 property, and any finite set A(x;y) of C- formulas has dual VC 
density at most 2\x\ ~ 1. 

Before proving this theorem, we introduce some notation and establish some auxiliary 
facts. We fix a model K of pCF, with valuation v. K ^ T^. We view Z as a convex 
subgroup of r, by identifying 1 with v{p). In the following, by a ball in K we always 
mean a closed ball, i.e., a set of the form 

B = Bp{a) = |a; e i^T : v{x — c) > p] where c £ K and p G T. 

Its radius, denoted rad(_B), is p. By convention Tad{K) := — oo. Let B denote the set 
of all balls in K. There is a natural semilinear partial order on B, with B < B' ii 
and only ii B ^ B' . A ball B = Bp{a) as above has a unique immediate predecessor, 
namely Bp^i{a), and p immediate successors, namely Bp^i{ai) where ai — a + ir for 
i = 0, . . . ,p — 1] here r is an arbitrary element of K with v{r) = p. Thus, if we form a 
graph with vertex set B, with vertices B, B' adjacent if and only if one of B, B' is an 
immediate successor of the other in the partial order, each of its connected components 
is an unrooted tree of valency p + 1. For B, B' G B we write dist(i3, B') — d ii B and 
B' are at distance d in this graph; for each B G B, there are {p + I)'' balls at distance 
d to B, and /3d ■— X]iLo(^' + 1)' = piiP ^ lY'^^ — 1) balls at distance at most d to B. 
Note that dist is a metric on each connected component of B. 

Lemma 7.3. Let A C_ K be finite, A ^ Then there at most \A\ — 1 distinct balls of 
the form i?i,(a-fc)(a) where a,b G A, a ^ b. 

Proof. We may assume |^| > 1. For a e A set 

/la := 1 + max {v{a — b) : b G A, a 7^ fe}. 

Let Ba be the smallest connected subgraph of B containing all Bf^^{a), a G A. Note 
that the balls Bp^{a), a G A, are pairwise disjoint; in particular, they are the leaves of 
the tree Ba- The balls By(a-b){a), where a,b G A, a ^ b are vertices of Ba, and all but 
one of them has degree greater than 2. Now use the fact that any (undirected) tree with 
finite vertex set V with |y| > 1 has 2 + X]t)GVdog(t))>2(^^s(^) ~ 2) leaves. (This follows 
immediately from the well-known formula 2{\V\ — 1) = X^usv '^es(''^)-) 

We also recall the following basic fact (a consequence of the Newton formulation 
of Hensel's Lemma, see [39j Lemma 2.3]) about the subgroups P^ = P„ \ {0} of the 
multiplicative group = K \ {0} of K: 

Lemma 7.4. Suppose n > 1, and let x,y, a G K with v{y — x) > 2v{n) +v{y ~ a) . Then 
{x-a){y-a)-^ G P„^ 

Suppose now that T is an >C-theory satisfying the hypothesis of Theorem 17.21 and 
K \= T. Employing definability of Skolem functions and the explicit description of 
immediate predecessors and successors in the partial order of B given above, one easily 
shows, by induction on d: 
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Lemma 7.5. Let {Bb)beK'n be a ^-definable family of subsets of K . Then there exist 
^-definable functions Ci,ri: K"^ — >■ K, i e N, with the following property: if b ^ K"^ is 
such that B = Bb is a ball in K , then the balls i?^, (;,)•) (c; (6)), where « = 1, . . . , Pd, are 
exactly the balls of distance at most d to B . 

The assumption of definable Skolem functions also guarantees that any model of 
the theory T has cell decomposition [7D]. Let A{x;y) be a finite set of /^-formulas, 
where |a;| — 1, closed under negation. Then there are integers N,n > 0, and for each 
i = 1, . . . ,N there are 0-definable functions fi, gi, Ci : i^'*'' — )■ K and elements Xi of a 
fixed set of representatives of the cosets of the subgroup of with the following 
properties: for any ip £ A and b € if'^l, the set ip{K;b) of realizations of ip{x;b) is a 
finite union of some of the cells Ui{b), . . . , UNib) defined by the data given above, i.e., 
sets of the form 

U,{b) = {xeK: v{f,ib))a,iv{x - c,{b))a,2v{g,{b)) k Pn{K{x - (7.5) 

where each symbol Oij is <, <, or no condition. Note that this includes the cases where 
Ui{b) — {ci{b)} is a singleton, or where 

a,{b) = {xeK: PniMx - cdb)))}. (7.6) 

The center of the cell Ui{b) is given by Ci{b). Since the value group of K has smallest 
positive element v{p), using the equivalences 

v{a) < v{a') v{pa) < v{a') 

v{a) < v{a') -4=> v{a) < v{pa'), 

valid for all a, a' E K, not both zero, one sees that we may assume that in (|7.5I) . On is < 
or no condition, and Oi2 is < or no condition. From now on, we assume for convenience 
that all our cells have this particular form. 

Based on the above data for the cell decomposition, we now describe a uniform 
definition of A{x; _B)-types over finite sets in K, in terms of the graph of balls B. A 
special ball is a ball having one of the following forms: 

Bvic,{b)-cj{b')){ct{b)), B^(^f.(^t)){ci{b)), By^g^(^b)){ci{b)) {b,b' e K^^^). 

Note that each special ball can be defined by using at most two parameter tuples. 
We say that B„(c,(f,)_c,(b'))(ci(^)), B^(f,{b))ici{b)) and B^,(g^(b))(ci(6)) are special balls 
defined over {b, b'}. We also say that a special ball is defined over a subset B oi K ii it 
is defined over {b, b'} where b, b' e B. By Lemma [7.31 given a finite B C K, there are 
no more than 3A^ • — 1 special balls defined over B. 

Let us say that a ball B' is near a ball B if dist(i3, B') < n + 4:v{n) + 2; each ball has 
(3 := /3ri+4i,(„)+2 balls near it. In particular, given 6i,&2 G there are at most M := 
(67V -!)■ 13 baUs near special baUs defined over {&i, 62}. Set /(^^ := [M] = {1, . . . , M} 
and 7*^^-' = I*^^^ := [N]. By Lemma [7.51 there are £-formulas Xi (2;; 2/1, 2/2), i G such 
that for each 61, 62 G K^^^ the formulas xA^'^ bi, ^2) define exactly the balls near special 
balls defined over {61, 62}- For each i e I*^^^ and G A define 

V^t^\y,yuy2) := yx{xtix;yi,y2) v{x;y))- 

So for b, bi, 62 G K^^^ we have: 

K ip['\b;bi,b2) ^ ^{K■b)^X^{K■MM)■ 
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For each i g /^^^ = /(^^ and g A set 

(2) 

Vi {y;yi) ■= v{ct{yi);y) 

and 

^?\yiyi) Vx(F/v(A.i(x- c,(yi))) (p{x;y)). 

Now set J'p' i^\^\y;yi,y2))veA for each i e /(j), j = 1,2,3. The first part of 
Theorem 17.21 will be proved once we show the following: 

Claim. (J-p'^) is a coherent definition of A(a;; i?)-types over finite sets. 

Since the coherency condition is obviously satisfied, it is enough to show that is a 
uniform definition of A(a;; i?)-types over finite sets. For this let B C K^^^ be finite and 
non-empty, and let q e S'^{B). Let 

c(B) := {c,(6) : 6e B, i = 1,...,N} 

be the set of centers of the cells Ui{b). In the following we let i range over [N] = 
{!,..., iV} and b (possibly with decorations) over B. By a "special ball" we always 
mean a special ball defined over B, and similarly a "near ball" is a ball near a special 
ball (defined over B). 

(2) 

We first eliminate two special cases (which are taken care of by the families ( J^j ) 
and (T-'^'')): Suppose first that n c{B) ^ 0, say Ci(6i) G q^ for some i and hi e B. 

(2) 

In this case, with such choice of i and &i, J^- (y; 6i) defines q. Similarly, if \c{B)\ = 1, 
and for all i and b with q^ C 1/^(6), the condition On is vacuous and 5^(6) = 0, then all 
such cells Ui{b) have the form as in (|7.6p . and for suitable i and &i, T-^\y; 61) defines q. 
So from now on we may assume that: 

(a) q^ is disjoint from c{B); and 

(b) if c{B) is a singleton, then for some i and 6 with q^ C [7^(6) the condition On 
is < or gi{b) ^ 0. 

Under these assumptions, it is enough to show: there is a near ball D such that D C q^ . 
We first note: 

Lemma 7.6. Let a € K \ c{B), and let Bi be a ball containing a which is maximal 
subject to the condition Bi n c{B) — 0; that is, Bi = Bs{a) where 

S = 1 + max {v{a — c) : c e c(_B)}. 

Let also Bq := B5+2D(n)('2)- Then for all x & K and c G c{B) we have: 

(1) a; e Bi v{x — c) = v{a — c); 

(2) xeBo^ix- c){a - c)-i e Z',,. 

particular, if a cell Ui{b) contains a then it contains Bq. 

The proof of (1) is obvious, and to deduce (2) from (1) use LemmalTjH 

Let a € K realize q, and define 6, Bq and Bi as in the previous lemma. Also, take 
c S c{B) such that 5 = 1 + v{a — c). 

Lemma 7.7. Let B2 be a ball. Then 

(1) B2 properly contains Bi iff it contains both a and c; 

(2) if B2 contains c but not a, then dist(i?i, B2) = rad(i?2) — S + 2. 
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The proof of (1) is clear, and for (2) note that if c G -B2 and a ^ i?2, then Bi,B2 C 
Bs-iia). 

We first assume that there is a special ball E such that dist(_Bi, E) < 2v{n) + n + 1. 
Then by Lemma 17.61 D := Bq = Bg_^2v{n){'^) is contained in those cells Ui(b) which 
contain a; hence D C . Also, dist(il', E) < n + Av{n) + 2, so D is near E. Hence the 
ball D has the required properties. 

So from now on, we may suppose that for any special ball E we have dist(_Bi, E) > 
2v{n) + n + 1. We distinguish two cases: 

Case 1: there is a special ball which contains Bi. We let C be the smallest such special 
ball, with radius p. We have c € C (since C properly contains Bi) and hence C = Bp{c). 
The idea now is to replace a and the ball D — Bq hy another realization a' E C oi q and 
a ball D' which is contained in and near the special ball C. As F is a Z-group, there is 
a unique S' €T such that 

p + 2v{n) + l <S' < p + 2v{n) + l + n and S' = Smodn. 

By assumption we have 

2v{n) +n + l< dist(Si, C) = S- p 

and hence S > S' > p + 1. Now choose d E P„ with v{d) = 5' — 5. (From now on until 
the end of the proof of Theorem 17.21 we temporarily suspend our promise of d always 
denoting a natural number.) Put 

D' := Bs>+2v{n)+iia') where a' d{a - c) + c. 

Then D' is contained in the special ball C, and D' is near C. Indeed, D' C Bs'-i{a'), 
and these two balls are at distance 2v{n) + 2. The latter ball also contains c, so 

dist{Bs'-i{a'),C) = dist{Bs'-i{c},Bp{c)) = 5' - I - p < 2v{n) + n, 

hence dist(Z3', C) < 4:v{n) + 2+n. Thus D' has the right properties, provided we manage 
to show: 

Claim 1. Let Ui{h) be a cell as in (|7.5|) which contains a. Then D' C Ui{b). 

Towards the proof of this claim, we first show two auxiliary claims: 
Claim 2. Let c' G c{B). Then 

'(5'-l if w(c-c') > w(a-c), 

v{c — c') < p < 5' — 1 otherwise. 



v{a' - c') 



Proof. If v{c — c') > v{a — c) then 

vie - c') > v{a - c) ^ 5 - 1 > S' - I 

and hence 

v{a' ~ c') = v{dia - c) + (c - c')) - <5' - 1. 
So suppose v{c — c') < v{a — c). We have v{a — c) > v{c — c'), so v{a — c') = v{c — c') 
and hence a is contained in the special ball E := i?„(c_c') ('^)- fact, for every x G Bi 
we have 

v{x — a) > S = v{a — c) + 1 > v{a — c) > v{c — c') 

and hence Bi C E. By minimality of C thus C 'Z E. This yields (5' — 1 > p > v{c — c') 
and thus v{a' - c') = v{c -~ c') < 5' -1. □ 
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Claim 3. For every c' G c{B) we have v{a — c') > v{a' — c'). 

Proof. Certainly, v{a — c') > niin{v(a — a'),v{a' — c')}. But the minimum is always 
achieved by v{a' — c'), as 

v{a - a') = v{{d - l)(a - c)) ^ 6' - 1 > v{a' - c') 

by Claim 2. □ 

By Claim 2 and Lemma l7.6l (applied to a' in place of a), in order to show Claim 1, it is 
enough to prove that a' e Ui{b). We abbreviate c' = Ci{h). Suppose that Dii is <. Then 
fi{h) ^ 0, and by Lemma FTHI (1), all elements x of Bi satisfy the condition v{fi{h)) < 
v{x — c'); hence C is contained in the special ball (c'), by the minimality of C. 

Since D' C C, all elements x of D' also satisfy v{fi{b)) < v{x — c'); in particular, of 
course, < v{a' — c'). If 0^2 is <, then by Claim 3, w(a' — c') < v{a—c') < v{gi{b)), 
as required. It remains to check that a — c' and a' — c' lie in the same coset of . We 
distinguish two cases. If v{c — c') < p then v{a — c) — 5 — l>p> v{c — c'), hence 
v{a — c') ~ v{c — c') and thus 

v{a — a') = (5' — 1 > 2w(n) + p> 2v{n) + via — c); 

therefore a—c' and a'—c' are in the same P^-coset, by Lemma [7.4l Suppose v{c—c') > p. 
Then by Claim 2 we have v{c — c') > v{a — c) =6—1 and v{a' — c') = 5' — 1. Now 
consider the special ball E := B^i^c_^f^{c). Note that a ^ E: otherwise v{c — c') = 
v{a — c) =5—1 and hence Bi = Bs{a) C E with dist(_Bi,i?) = 1, contrary to our initial 
assumption (made before Case 1). Thus, by Lemma 17.71 (2) and said assumption, we 
obtain 2v{n) + n + 1 < v{c — c') — S + 2. Hence 

v{c — c) > 2v{n) + 6 — 1 = 2v{n) + v{a — c) > 2v(n) + v(a' — c'), 

with the last inequality by Claim 3. So by Lemma |7.4[ a — c' and a — c are in the same 
-coset, as are a' — c and a' — c' . Certainly, as a' — c = d{a — c) and d E P^ , the 
elements a — c and a' — c lie in the same coset of . Hence a — c' and a' — c' also lie 
in the same coset of P^ . This finishes the proof of Claim 1, and hence of Case 1. □ 

Case 2: no special ball contains Bi. In this case, for every c' G c{B), the special ball 
C = P«(c-c')(c) does not contain a, so v{c' — c) > v{a — c) = S — 1 and hence 

v{a — c') = min{v(a — c), v{c — c')} = v(a — c) = 5 — 1. 

Since C is of distance greater than 2v{n) + n + l from Pi, by part (2) of Lemma 17771 we 
also obtain 

v{c-c')>6 + 2v{n)+n-l. (7.7) 

Similarly, since each special ball By(^g.(^i,-j-){ci{b)) does not contain a, we get 

v{g^{b)) > S + 2v{n)+n - 1, 

and since a ^ Byi^f.(^i,)){ci{b)), the condition On is vacuous for each i and & with a G Ui{b). 

Fix a special ball E of the form By(^g.(^{,^-){ci(b)) with minimal radius 7 = v{gi(b)), if 
there is such a special ball; otherwise let 7 = 00. Also, if |c(P)| > 1, let C be a special 
ball of the form Pt,(c-c')(c); where c' G c(P), with minimal radius p = v{c — c'); we set 
p = cx) if |c(P)| = 1. Note that by our general assumption (made before Lemma 17.61) . 
not both of 7 and p are 00. We now distinguish two subcases: 
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Case 2a: p — 2w(n) < 7. Let 5' such that 

p — 2v(n) — n < d' < p — 2u(n), S' = 6 mod n, 

choose d Q Pn with v{d) = S' — S, and set 

D' := i?5'+2u(n) (1') where a' d{a — c) + c. 

By ()7.7p we have S' > S. Moreover, for each c" e c{B) we have w(c— c") > p > S' — 1 ^ 
v{d{a — c)) and hence v{a' — c") = 5' — 1. Note that the ball i?5'_i(a') contains D' and 
is of distance 2v{n) + 1 to D'. The ball Bs'-i{a') contains c, hence 

dist(B5'_i(a'),C) <p-{S'-l)< 2v{n)+n+l 

and thus dist(_D', C) < 4u(n) + n + 2, so D' is near C. Let Ui{b) be a cell containing a; it 
remains to show that then a' G Ui{b). We already noted that condition On is vacuous. 
As to Oi2, suppose that condition is <. Writing c' = Ci{b) we then have 

v{a' -c') = S' -1< p- 2v{n) < 7 < v{g,{b)) 

as required. Finally, by (|7.7p and Lemma [7^ a — c' and a — c are in the same P^-coset, 
and since 

v{c -c')>p> 2v{n) + (5' - 1 = 2v{n) + v{a' - c'), 
the elements a' — c and a' — c' are also in the same P,^-coset. As a' — c — d{a — c) and 
a — c are in the same P^-coset, finally a' — c' and a — c! are in the same P^-coset, as 
required. □ 

Case 2b: p — 2v{n) > 7. In this case we let (5' e F be such that 

7 — 2v{n) — n < d' < J — 2v{n), S' = 6 mod n, 

and with this choice of 6' define d, a' and D' as in Case 2a. Note that p > 7, so for each 
c" £ c{B) we have v{c — c") > 6' — \ = v{d{a — c)) and hence v{a' — c") — 5' — 1. Since 

dist{Bs'-i{a),E) < j - {S' - 1) < 2v{n) + n + 1 

we see, similarly as in Case 2a, that dist(£'', E) < 4v(n)+n+2, so D' is near E. Let Ui{b) 
be a cell containing a, and suppose Di2 is <. Then v{a' — c') = 5' — 1 < 7 < v{gi{b)), 
and as at the end of Case 2a one sees that a' — c' and a — c' are in the same coset 

0fP„x. □ 

To complete the proof of the theorem, we apply Theorem [7jT] with r = 1. By what we 
have shown above, for every finite non-empty B C M'^', each type in S^{B) is uniquely 
determined by either a center Ci{b), where 6 e B, or a near ball. However, there are 
at most N\B\ = 0{\B\) centers, and at most {3N\B\ - 1) • ^ = 0{\B\) near balls; thus 
\S^(B)\ = o{\B\) as required. □ 

From Theorem 17.21 and CoroUarv 15 . 131 we obtain: 

Corollary 7.8. Every P-minimal theory with definable Skolem functions is dp-minimal. 

Remark 7.9. In |M1 Section 6], Dolich, Goodrick and Lippel already showed that pCF = 
Th((Q)p) is dp-minimal. By 3.6 of [35], the P-minimal theory pCF^^j^ has definable Skolem 
functions. (Formally, [221 handles the corresponding subanalytic structure on Zp, but 
the translation is straightforward.) Note that the proof of ^22, 3.6] takes place in the 
ground model Zp, where all elements are named by constants, so 'definable' means '0- 
definable', and curve selection, as stated there, gives definable Skolem functions. Hence 
the conclusion of Theorem 17.21 and Corollary [TjH apply to it and its reducts. (See also 
Lemma [3.61 ) Cell decomposition in pCF^^^ is also proved in |21) . 
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